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GROUP B STREPTOCOCCUS ANTIGENS 

5 

FIELD OF THE INVENTION 

The present invention is related to antigens, more 
particularly protein antigens of group B streptococcus (GBS) 
0 bacterial pathogen which are useful as vaccine components 
for therapy and/or prophylaxis. 



15 



20 



25 



30 



35 



BACKGROUND OF THE INVENTION 

Streptococcus are gram (+) bacteria that are differentiated 
by group specific carbohydrate antigens A through O found on 
their cell surface. Streptococcus groups are further 
distinguished by type-specific capsular polysaccharide 
antigens. Several serotypes have been identified for the 
Group B streptococcus (GBS) : la, lb, II, III, iv, V, VI, 
VII and VIII. GBS also contains antigenic proteins known as 
"C-proteins" (alpha, beta, gamma and delta), some of which 
have been cloned. 

Although GBS is a common component of the normal human 
vaginal and colonic flora this pathogen has long been 
recognized as a major cause of neonatal sepsis and 
meningitis, late-onset meningitis in infants, postpartum 
endometritis as well as mastitis in dairy herds. Expectant 
mothers exposed to GBS are at risk of postpartum infection 
and may transfer the infection to their baby as the child 
passes through the birth canal. Although the organism is 
sensitive to antibiotics, the high attack rate and rapid 
onset of sepsis in neonates and meningitis in infants 
results in high morbidity and mortality. 

1 
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To find a vaccine that will protect individuals from GBS 
infection, researches have turned to the type- specif ic 
antigens. Unfortunately these polysaccharides have proven to 

5 be poorly immunogenic in humans and are restricted to the 
particular serotype from which the polysaccharide 
originates. Further, capsular polysaccharide elicit a T 
cell independent response i.e. no IgG production. 
Consequently capsular polysaccharide antigens are unsuitable 

10 as a vaccine component for protection against GBS infection. 

Others have focused on the C-protein beta antigen which 
demonstrated immunogenic properties in mice and rabbit 
models . This protein was found to be unsuitable as a human 

15 vaccine because of its undesirable property of interacting 
with high affinity and in a non- immunogenic manner with the 
Fc region of human IgA. The C-protein alpha antigen is rare 
in type III serotypes of GBS which is the serotype 
responsible for most GBS mediated conditions and is 

20 therefore of little use as a vaccine component . 

Therefore there remains an unmet need for GBS antigens that 
may be used as vaccine components for the prophylaxis and/ or 
25 therapy of GBS infection. 

SUMMARY OF THE INVENTION 

30 According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence selected from the group consisting of : 
SEQ ID NO: 2, SEQ ID NO : 3, SEQ ID NO : 4, SEQ ID NO : 5, 

35 SEQ ID NO: 6, SEQ ID NO : 8, SEQ ID NO : 9, SEQ ID NO: 10, 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 15, 
SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 16, SEQ ID NO: 19, 

2 
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SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 24, 

SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, 

SEQ ID NO: 30, SEQ ID NO : 3 1 , SEQ ID NO: 33, SEQ ID NO: 34, 

SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 38, SEQ ID NO: 39, 

5 SEQ ID NO: 40, SEQ ID NO: 41 and SEQ ID NO: 44 or fragments, 
analogs or derivatives thereof. 

In other aspects, there is provided vectors comprising 
polynucleotides of the invention operably linked to an 
10 expression control region, as well as host cells transfected 
with said vectors and methods of producing polypeptides 
comprising culturing said host cells under conditions 
suitable for expression. 

15 In yet another aspect, there is provided novel polypeptides 
encoded by polynucleotides of the invention. 



BRIEF DESCRIPTION OF THE DRAWINGS 

20 

Figure la is the DNA sequence of clone 1 (SEQ ID NO :1) with 
corresponding amino acid sequences for open reading frames ; 
figure lb is the amino acid sequence SEQ ID NO: 2; 
figure 1c is the amino acid sequence SEQ ID NO : 3 ; 
25 figure Id is the amino acid sequence SEQ ID NO : 4 / 
figure le is the amino acid sequence SEQ ID NO: 5; 
figure If is the amino acid sequence SEQ ID NO: 6 ; 

Figure 2a is the DNA sequence of clone 2 (SEQ ID NO :7) with 
30 corresponding amino acid sequences for open reading frames; 

figure 2b is the amino acid sequence SEQ ID NO: 8; 

figure 2c is the amino acid sequence SEQ ID NO : 9 ; 

figure 2d is the amino acid sequence SEQ ID NO: 10; 

figure 2e is the amino acid sequence SEQ ID NO: 11; 

35 figure 2f is the amino acid sequence SEQ ID NO: 12; 



3 



BNSDOCID- <WO_ . 99-425SBA2 > > 



WO 99/42588 



PCT/CA99/00114 



Figure 3a is the DNA sequence of clone 3 (SEQ ID NO :13) 
with corresponding amino acid sequences for open reading 
frames ; 
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Figure 4a is the DNA sequence of clone 4 (SEQ ID NO :22) 
with corresponding amino acid sequences for open reading 
frames ; 

figure 4b is the amino acid sequence SEQ ID NO: 23 
figure 4c is the amino acid sequence SEQ ID NO: 24 
figure 4d is the amino acid sequence SEQ ID NO: 25 
figure 4e is the amino acid sequence SEQ ID NO: 26 

Figure 5a is the DNA sequence of clone 5 (SEQ ID NO :27) 
with corresponding amino acid sequences for open reading 
frames ; 

figure 5b is the amino acid sequence SEQ ID NO: 28 

figure 5c is the amino acid sequence SEQ ID NO: 29 

figure 5d is the amino acid sequence SEQ ID NO: 30 

figure 5e is the amino acid sequence SEQ ID NO: 31 

Figure 6a is the DNA sequence of clone 6 (SEQ ID NO :32) 

figure 6b is the amino acid sequence SEQ ID NO: 33 

figure 6c is the amino acid sequence SEQ ID NO: 34 

figure 6d is the amino acid sequence SEQ ID NO: 35 

figure 6e is the amino acid sequence SEQ ID NO: 36 

Figure 7a is the DNA sequence of clone 7 (SEQ ID NO :37); 
figure 7b is the amino acid sequence SEQ ID NO: 38; 
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figure 7c is the amino acid sequence SEQ ID NO: 39 
figure Id is the amino acid sequence SEQ ID NO: 40 
figure 7e is the amino acid sequence SEQ ID NO: 41 

5 Figure 8 is the DNA sequence of a part of clone 7 including 
a signal sequence (SEQ ID NO :42) ; 

Figure 9 is the DNA sequence of a part of clone 7 without a 
signal sequence (SEQ ID NO :43) ; 
10 Figure 9a is the amino acid sequence (SEQ ID NO:44); 

Figure 10 represents the distribution of anti-GBS ELISA 
titers in sera from CD-I mice immunized with recombinant GBS 
protein corresponding to the SEQ ID NO: 39. 



5 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to novel antigenic 
polypeptides of group B streptococcus (GBS) characterized by 

the ammo acid sequence selected from the group consisting 

or : 3 

SEQ ID NO: 2, SEO ID NO- ^ ciro m ^ 

/ o^w ^ JNU. j, SEQ ID NO: 4, SEQ ID NO • 5 

SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO-1 0 ' 
SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO-ls' 
SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO: 18, SEQ ID NO-l 9 ' 
SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO-2 4 ' 
SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO- 29 ' 
SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO.34' 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO-3 9 ' 
SEQ ID NO:40, SEQ ID NO:41 and SEQ ID NO:44 or fragments, 
analogs or derivatives thereof. 

A preferred embodiment of the invention includes SEQ ID 

NO :39 and SEQ ID NO: 44. 

A further preferred embodiment of the invention is SEQ ID 
NO :39. W 

A further preferred embodiment of the invention is SEQ ID 

NO : 4 4 . 

As used herein, "fragments", "derivatives" or "analogs" of 
the polypeptides of the invention include those polypeptides 
in which one or more of the amino acid residues are 
substituted with a conserved or non-conserved amino acid 
rescue (preferably conserved) and which may be natural or 
unnatural . 

The terms fragments,, .derivatives, or «analogues» of 
polypeptides of the present invention also include 
polypeptides which are modified by addition, deletion 
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substitution of amino acids provided that the polypeptides 
retain the capacity to induce an immune response. 

By the term «conserved amino acid» is meant a substitution 
5 of one or more amino acids for another in which the 

antigenic determinant (including its secondary structure and 
hydropathic nature) of a given antigen is completely or 
partially conserved in spite of the substitution. 

10 For example, one or more amino acid residues within the 
sequence can be substituted by another amino acid of a 
similar polarity, which acts as a functional equivalent, 
resulting in a silent alteration. Substitutes for an amino 
acid within the sequence may be selected from other members 

15 of the class to which the amino acid belongs. For example, 
the nonpolar (hydrophobic) amino acids include alanine, 
leucine, isoleucine, valine, proline, phenylalanine, 
tryptophan and methionine. The polar neutral amino acids 
include glycine, serine, threonine, cysteine, tyrosine, 

20 asparagine and glutamine. The positively charged (basic) 
amino acids include arginine, lysine and histidine. The 
negatively charged (acidic) amino acids include aspartic 
acid and glutamic acid. 

25 Preferably, derivatives and analogs of polypeptides of the 
invention will have about 70% identity with those sequences 
illustrated in the figures or fragments thereof. That is, 
70% of the residues are the same. More preferably 
polypeptides will have greater than 95% homology. In another 

30 preferred embodiment, derivatives and analogs of 

polypeptides of the invention will have fewer than about 2 0 
amino acid residue substitutions, modifications or deletions 
and more preferably less than 10. Preferred substitutions 
are those known in the art as conserved i.e. the substituted 

35 residues share physical or chemical properties such as 
hydrophobic! ty, size, charge or functional groups. 

7 
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15 



20 



30 



35 



Furthermore, in those situations where amino acid regions 
are found to be polymorphic, it may be desirable to vary one 
or more particular amino acids to more effectively mimic the 
different epitopes of the different GBS strains. 

Also included are polypeptides which have fused thereto 
other compounds which alter the polypeptides biological or 
pharmacological properties i.e. polyethylene glycol (PEG) to 
increase half -life; leader or secretory amino acid sequences 
for ease of purification; prepro- and pro- sequences; and 
(poly) saccharides . 

Moreover, the polypeptides of the present invention can be 
modified by terminal -NH 2 acylation (eg. by acetylation, or 
thioglycolic acid amidation, terminal carbosy amidation, 
e.g. with ammonia or methylamine) to provide stability, 
increased hydrophobic! ty for linking or binding to a support 
or other molecule . 



Also contemplated are hetero and homo polypeptide multimers 
of the polypeptide fragments, analogues and derivatives. 
These polymeric forms include, for example, one or more 
polypeptides that have been cross-linked with cross-linkers 
25 such as avidin/biotin, gluteraldehyde or dimethyl - 
superimidate . Such polymeric forms also include 
polypeptides containing two or more tandem or inverted 
contiguous sequences, produced from multicistronic mRNAs 
generated by recombinant DNA technology. 
Preferably, a fragment, analog or derivative of a 
polypeptide of the invention will comprise at least one 
antigenic region i.e. at least one epitope. 



* 

In order to achieve the formation of antigenic polymers 
(i.e. synthetic multimers), polypeptides may be utilized 
having bishaloacetyl groups, nitroarylhalides , or the like. 



8 
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where the reagents being specific for thic groups. 
Therefore, the link between two mercapto groups of the 
different peptides may be a single bond or may be composed 
of a linking group of &t least two, typically at least four, 
5 and not more than 16, but usually not more than about 14 
carbon atoms . 

In a particular embodiment, polypeptide fragments, analogs 
and derivatives of the invention do not contain a methionine 

10 (Met) starting residue. Preferably, polypeptides will not 
incorporate a leader or secretory sequence (signal 
sequence) . The signal portion of a polypeptide of the 
invention may be determined according to established 
molecular biological techniques. In general, the 

15 polypeptide of interest may be isolated from a GBS culture 
and subsequently sequenced to determine the initial residue 
of the mature protein and therefor the sequence of the 
mature polypeptide . 

20 According to another aspect, there is provided vaccine 

compositions comprising one or more GBS polypeptides of the 
invention in admixture with a pharmaceutical^ acceptable 
carrier diluent or adjuvant. 

25 Suitable adjuvants include oils i.e. Freund's complete or 
incomplete adjuvant; salts i.e. A1K(S0 4 ) 2 , AlNa(S0 4 ) 2 , 
AlNH 4 (S0 4 ) 2 , Al(OH) 3 , AlP0 4 , silica, kaolin; saponin 
derivative; carbon polynucleotides i.e. poly IC and poly AU 
and also detoxified cholera toxin (CTB)and E.coli heat 

30 labile toxin for induction of mucosal immunity. Preferred 
adjuvants include QuilA™, Alhydrogel™ and Adjuphos™. 
Vaccines of the invention may be administered parenterally 
by injection, rapid infusion, nasopharyngeal absorption, 
dermoabsorption, or bucal or oral. 

35 

9 
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15 



Vaccine compositions of the invention are used for the 
treatment or prophylaxis of streptococcus infection and/or 
diseases and symptoms mediated by streptococcus infection, 
5 in particular group A streptococcus (pyogenes) , group B 
streptococcus (GBS or agalactiae) , dysgalactiae , uteris, 
nocardia as well as Staphylococcus aureus. General 
information about Streptococcus is available in Manual of 
Clinical Microbiology by P.R.Murray et 31.(1995, 6 th Edition, 
ASM Press, Washington, D.C.). More particularly group B 
streptococcus, agalactiae. In a particular embodiment 
vaccines are administered to those individuals at risk of 
GBS infection such as pregnant women and infants for sepsis, 
meningitis and pneumonia as well as immunocompromised 
individuals such as those with diabetes, liver disease or 
cancer. Vaccines may also have veterinary applications such 
as for the treatment of mastitis in cattle which is mediated 
by the above mentioned bacteria as well as E.coli. 



20 



25 



The vaccine of the present invention can also be used for 
the manufacture of a medicament used for the treatment or 
prophylaxis of streptococcus infection and/or diseases and 
symptoms mediated by streptococcus infection, in particular 
group A streptococcus (pyogenes) , group B streptococcus (GBS 
or agalactiae) , dysgalactiae , uteris, nocardia as well as 
Staphylococcus aureus. More particularly group B 
streptococcus t agalactiae . 



Vaccine compositions are preferably in unit dosage form of 
30 about 0.001 to 100 fig/kg (antigen/body weight) and more 
preferably 0.01 to 10 ^lg/kg and most preferably 0 . 1 to 1 
Hg/kg 1 to 3 t imes with an .interval of about 1 to 12 weeks 
intervals between immunizations, and more preferably 1 to 6 

10 
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weeks . 

According to another aspect, there is provided 
polynucleotides encoding polypeptides of group B 
streptococcus (GBS) characterized by the amino acid sequence 
selected from the group consisting of : 

SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO : 5, 
SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO : 9, SEQ ID NO: 10, 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO : 14 , SEQ ID NO: 15, 
SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 
SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 24, 
SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, 
SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 33, SEQ ID NO: 34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:39, 
SEQ ID NO: 40, SEQ ID NO: 41 and SEQ ID NO: 44 or fragments, 
analogs or derivatives thereof. 

Preferred polynucleotides are those illustrated in figures 

la (SEQ ID NO: 1), 2a (SEQ ID NO: 7), 3a (SEQ ID NO: 13), 4a 
(SEQ ID NO: 22), 5a (SEQ ID NO: 27), 6a (SEQ ID NO: 32), 7a 
(SEQ ID NO: 37), 8 (SEQ ID NO : 42) and 9 (SEQ ID NO : 43) 

which correspond to the open reading frames, encoding 

polypeptides of the invention. 

Preferred polynucleotides are those illustrated in figures 
la (SEQ ID NO: 1), 2a (SEQ ID NO: 7), 3a (SEQ ID NO: 13), 4a 
(SEQ ID NO: 22), 5a (SEQ ID NO: 27), 6a (SEQ ID NO: 32), 7a 
(SEQ ID NO: 37), 8 (SEQ ID NO : 42) and 9(SEQ ID NO : 43) 
and fragments, analogues and derivatives thereof. 

More preferred polynucleotides of the invention are those 

illustrated in Figures 7 (SEQ ID NO : 37) , 8 (SEQ ID NO : 
42) and 9 (SEQ ID NO : 43). 

Most preferred polynucleotides of the invention are those 

illustrated in Figures 8 (SEQ ID NO : 42) and 9 (SEQ ID NO : 

11 
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43) . 

It will be appreciated that the polynucleotide sequences 
illustrated in the figures may be altered with degenerate 
5 codons yet still encode the polypeptides of the invention. 

Due to the degeneracy of nucleotide coding sequences, other 
polynucleotide sequences which encode for substantially the 
same polypeptides of the present invention may be used in 
10 the practice of the present invention. These include but are 
not limited to nucleotide sequences which are altered by the 
substitution of different codons that encode the same amino 
acid residue within the sequence, thus producing a silent 
change . 

15 

Accordingly the present invention further provides 
polynucleotides which hybridize to the polynucleotide 
sequences herein above described (or the complement 
sequences thereof) having 5 0% and preferably at least 70% 
20 identity between sequences. More preferably polynucleotides 
are hybridizable under stringent conditions i.e. having at 
least 95% identity and most preferably more than 97% 
identity . 

25 By capable of hybridizing under stringent conditions is 
meant annealing of a nucleic acid molecule to at least a 
region of a second nucleic acid sequence (whether as cDNA, 
mRNA, or genomic DNA) or to its complementary strand under 
standard conditions, e.g. high temperature and/or low salt 

30 content, which tend to disfavor hybridization of 

noncomplementary nucleotide sequences. A suitable protocol 
is described in Maniatis T. et al . , Molecular cloning : A 
Laboratory Manual, Cold Springs Harbor Laboratory, 1982, 
which is herein incorporated by reference. 

35 

In a further aspect, polynucleotides encoding polypeptides 

12 
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of the invention, or fragments, analogs or derivatives 
thereof, may be used in a DNA immunization method. 
That is, they can be incorporated into a vector which is 
replicable and expressible upon injection thereby producing 
5 the antigenic polypeptide in vivo. For example 

polynucleotides may be incorporated into a plasmid vector 
under the control of the CMV promoter which is functional in 
eukaryotic cells. Preferably the vector is injected 
intramuscularly . 

10 

According to another aspect, there is provided a process for 
producing polypeptides of the invention by recombinant 
techniques by expressing a polynucleotide encoding said 
polypeptide in a host cell and recovering the expressed 
15 polypeptide product. Alternatively, the polypeptides can be 
produced according to established synthetic chemical 
techniques i.e. solution phase or solid phase synthesis of 
oligopeptides which are ligated to produce the full 
polypeptide (block ligation) . 

20 

For recombinant production, host cells are transfected with 
vectors which encode the polypeptide, and then cultured in a 
nutrient media modified as appropriate for activating 
promoters, selecting transf ormants or amplifying the genes. 
25 Suitable vectors are those that are viable and replicable in 
the chosen host and include chromosomal, non- chromosomal and 
synthetic DNA sequences e.g. bacterial plasmids, phage DNA, 
baculovirus, yeast plasmids, vectors derived from 
combinations of plasmids and phage DNA. The polypeptide 

30 sequence may be incorporated in the vector at the 

appropriate site using restriction enzymes such that it is 
operably linked to an expression control region comprising a 
promoter, ribosome binding site (consensus region or Shine- 
Dalgarno sequence) , and optionally an operator (control 

35 element) . One can select individual components of the 

expression control region that are appropriate for a given 
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host and vector according to established molecular biology 
principles (Sambrook et al, Molecular Cloning: A Laboratory 
Manual, 2nd ed. , Cold Spring Harbor, N.Y., 1989 incorporated 
herein by reference) . Suitable promoters include but are not 
limited to LTR or SV40 promoter, E.coli lac, tac or trp 
promoters and the phage lambda P L promoter. Vectors will 
preferably incorporate an origin of replication as well as 
selection markers i.e. ampicillin resistance gene. Suitable 
bacterial vectors include pET, pQE7 0, pQE6 0, pQE-9, pbs, 
pDIO phagescript, psiX174, pbluescript SK, pbsks, pNH8A, 
pNH16a, pNH18A, pNH46A, ptrc99a, pKK223-3, pKK233-3, pDR540, 
pRIT5 and eukaryotic vectors pBlueBacIII, pWLNEO, pSV2CAT, 
pOG44, pXTl, pSG, pSVK3, pBPV, pMSG and pSVL. Host cells 
may be bacterial i.e. E.coli, Bacillus subtilis, 
Streptomyces; fungal i.e. Aspergillus niger, Aspergillus 
nidulins; yeast i.e. Saccharomyces or eukaryotic i.e. CHO, 
COS. 

Upon expression of the polypeptide in culture, cells are 
typically harvested by centrif ugation then disrupted by 
physical or chemical means (if the expressed polypeptide is 
not secreted into the media) and the resulting crude extract 
retained to isolate the polypeptide of interest. 
Purification of the polypeptide from culture media or lysate 
25 may be achieved by established techniques depending on the 

properties of the polypeptide i.e. using ammonium sulfate or 
ethanol precipitation , acid extraction, anion or cation 
exchange chromatography, phosphocellulose chromatography, 
hydrophobic interaction chromatography, hydroxylapatite 
chromatography and lectin chromatography. Final 
purification may be achieved using HPLC. 

The polypeptide may be expressed with or without a leader or 
secretion sequence. In the former case the leader may be 
removed using post-translational processing (see US 



20 



30 



35 
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4,431,739; 4,425,437; and 4,33 8,397 incorporated herein by 
reference) or be chemically removed subsequent to purifying 
the expressed polypeptide. 

5 According to a further aspect, the GBS polypeptides of the 
invention may be used in a diagnostic test for streptococcus 
infection in particular GBS infection. Several diagnostic 
methods are possible, for example detecting streptococcus 
organism in a biological sample, the following procedure may 

10 be followed: 

a) obtaining a biological sample from a patient; 

b) incubating an antibody or fragment thereof reactive 
with a GBS polypeptide of the invention with the 
biological sample to form a mixture; and 

15 c) detecting specifically bound antibody or bound fragment 

in the mixture which indicates the presence of 
streptococcus . 

Alternatively, a method for the detection of antibody 
20 specific to a streptococcus antigen in a biological sample 
containing or suspected of containing said antibody may be 
performed as follows : 

a) isolating a biological sample from a patient; 

b) incubating one or more GBS polypeptides of the 

25 invention or fragments thereof with the biological 

sample to form a mixture ; and 

c) detecting specifically bound antigen or bound fragment 
in the mixture which indicates the presence of antibody 
specific to streptococcus . 

30 

One of skill in the art will recognize that this diagnostic 
test may take several forms, including an immunological test 
such as an enzyme- linked immunosorbent assay (ELISA) , a 
radioimmunoassay or a latex agglutination assay, essentially 
35 to determine whether antibodies specific for the protein are 
present in an organism . 
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The DNA sequences encoding polypeptides of the invention may 
also be used to design DNA probes for use in detecting the 
presence of streptococcus in a biological sample suspected 
of containing such bacteria. The detection method of this 
invention comprises : 

a) isolating the biological sample from a patient; 

b) incubating one or more DNA probes having a DNA sequence 
encoding a polypeptide of the invention or fragments 
thereof with the biological sample to form a mixture ; 
and 

c) detecting specifically bound DNA probe in the mixture 
which indicates the presence of streptococcus bacteria. 

The DNA probes of this invention may also be used for 
detecting circulating streptococcus i.e. GBS nucleic acids 
in a sample, for example using a polymerase chain reaction, 
as a method of diagnosing streptococcus infections. The 
probe may be synthesized using conventional techniques and 
may be immobilized on a solid phase, or may be labeled with 
a detectable label . A preferred DNA probe for this 
application is an oligomer having a sequence complementary 
to at least about 6 contiguous nucleotides of the GBS 
polypeptides of the invention. 



Another diagnostic method for the detection of streptococcus 
in a patient comprises: 

a) labeling an antibody reactive with a polypeptide of the 

invention or fragment thereof with a detectable label; 
30 b) administering the labeled antibody or labeled fragment 

to the patient ; and 
c) detecting specifically bound labeled antibody or 

labeled fragment in the patient which indicates the 

presence of streptococcus . 



further aspect of the invention is the use of the GBS 
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polypeptides of the invention as immunogens for the 
production of specific antibodies for the diagnosis and in 
particular the treatment of streptococcus infection. 
Suitable antibodies may be determined using appropriate 

5 screening methods, for example by measuring the ability of a 
particular antibody to passively protect against 
streptococcus infection in a test model . One example of an 
animal model is the mouse model described in the examples 
herein. The antibody may be a whole antibody or an antigen- 

10 binding fragment thereof and may in general belong to any 
immunoglobulin class. The antibody or fragment may be of 
animal origin, specifically of mammalian origin and more 
specifically of murine, rat or human origin. It may be a 
natural antibody or a fragment thereof, or if desired, a 

15 recombinant antibody or antibody fragment. The term 

recombinant antibody or antibody fragment means antibody or 
antibody fragment which were produced using molecular 
biology techniques. The antibody or antibody fragments may 
be polyclonal, or preferably monoclonal. It may be specific 

20 for a number of epitopes associated with the GBS 
polypeptides but is preferably specific for one. 

EXAMPLE 1 Murine model of lethal Group B Streptococcus (GBS) 
25 infection 

The mouse model of GBS infection is described in detail in 
Lancefield et al (J Exp Med 142:165-179,1975). GBS strain 
C388/90 (Clinical isolate obtained in 1990 from the 

30 cephalorachidian fluid of a patient suffering from 

meningitis, Children's Hospital of Eastern Ontario, Ottawa, 
Canada) and NCS246 (National Center for Streptococcus, 
Provincial Laboratory of Public Health for Northern Alberta, 
Edmonton, Canada) were respectively serotyped as type Ia/c 

35 and type II/R. 
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To increase their virulence, the GBS strains C388/90 
(serotype Ia/c) and NCS 246 (serotype II/R) were serially 
passaged through mice as described previously (Lancefield et 
al. J Exp Med 142:165-179, 1975). Briefly, the increase of 
5 virulence was monitored using intraperitoneal inoculations 
of serial dilutions of a subculture in Todd-Hewitt broth 
obtained from either the blood or spleen of infected mice. 
After the last passage, infected blood samples were used to 
inoculate Todd-Hewitt broth. After an incubation of 2 hours 

10 at 37°C with 7% C0 2 , glycerol at a final concentration of 
10% (v/v) was added to the culture. The culture was then 
aliquoted and stored at -80° C for use in GBS challenge 
experiments . The number of cf u of GBS present in these 
frozen samples was determined. The bacterial concentration 

15 necessary to kill 100% (LD100) of the 18 weeks old mice were 
determined to be 3.5X10 5 and 1.1X10 5 respectively for GBS 
strain C388/90 and NCS24 6, which corresponded to a 
significant increase in virulence for both strains. Indeed, 
the LD10 0 recorded before the passages for these two strains 

20 was higher than 10 9 cfu. 

In a bacterial challenge, a freshly thawed aliquot of a 
virulent GBS strain was adjusted to the appropriate 
bacterial concentration using Todd-Hewitt broth and 1ml was 

25 injected intraperitoneally to each female CD-I mouse. The 

mice used for the passive protection experiments were 6 to 8 
weeks old, while the ones used for the active protection 
experiments were approximately 18 weeks old at the time of 
the challenge. All inocula were verified by colony counts. 

30 Animals were observed for any sign of infection four times 
daily for the first 48 h after challenge and then daily for 
the next 12 days. At the end of that period, blood samples 
were obtained from the survivors and frozen at -20 °C. The 
spleen obtained from each mouse that survived the challenge 

35 was cultured in order to identify any remaining GBS. 
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EXAMPLE 2 Immunization and protection in mice with 

formaldehyde killed whole GBS cells 

5 Formaldehyde killed GBS whole cells were prepared according 
to the procedures described in Lancefield et al (J Exp Med 
142:165-179,1975). Briefly, an overnight culture on sheep 
blood agar plates (Quelab Laboratories, Montreal, Canada) of 
a GBS strain was washed twice in PBS buffer (phosphate 
10 buffered- saline , pH7.2), adjusted to approximately 3X10 9 

cfu/mL and incubated overnight in PBS containing 0.3% (v/v) 
formaldehyde. The killed GBS suspension was washed with PBS 
and kept frozen at -80°C. 

15 Female CD-I mice, 6 to 8 weeks old (Charles River, St- 
Constant, Quebec, Canada), were injected subcutaneously . 
three times at two weeks interval with 0.1 ml of 
formaldehyde killed cells of GBS strain C3 88/90 (~6X10 7 GBS) , 
or 0.1 ml of PBS for the control group. On the day before 

20 the immunization, Alhydrogel™ (Superfos Biosector, 

Frederikssund, Denmark) at a final concentration of 0.14 mg 
or 0.21 mg of Al , was added to these preparations and 
incubated overnight at 4°C with agitation. Serum samples 
were obtained from each mouse before the beginning of the 
25 immunization protocol and two weeks after the last 
injection. The sera were frozen at -20°C. 

Eight mice in each control group injected with PBS and the 
group immunized with formaldehyde killed whole cells GBS 

30 strain C388/90 (Ia/c) were challenged with 1.5X10 4 cfu of 
GBS strain C388/90 (Ia/c) one week after the third 
injection. All mice immunized with the formaldehyde killed 
GBS whole cells survived the homologous challenge while, 
within 5 days after the challenge, only 4 out of the 8 mice 

35 injected with PBS survived from the infection. In order to 
increase the mortality rate in the control groups, the 
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bacterial suspension had to be adjusted according to the ag 
of the mice at the time of the bacterial challenge. In 
subsequent challenge experiments, when mice were older than 
15 weeks, the bacterial inoculum was increased to 
5 concentrations between 3.0X10 S and 2.5X10 6 cfu. 
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Table 1 Immunization of CD1 mice with formaldehyde 

killed whole cells of GBS and subsequent 
homologous challenge [strain C388/90 (Ia/c)] 
and heterologous challenge [strain NCS246 
5 (II/R) ] - 



antigenic preparations 
used for immunization 1 


number of living mice 14 days after the bacterial 

challenge 
(% Survival) 


homologous challenge: 
strain C388/90 (la/c) 


heterologous challenge: 
strain NCS246 (ll/R) 


1st infection 


formaldehyde killed cells 
of GBS strain C388/90 
(la/c) 2 


8/8 (100) 3 


n.d. 5 


control PBS 


4/8 (50) 


n.d. 


2nd infection 


formaldehyde killed cells 
of GBS strain C388/90 
(la/c) 


6/6 (100) 4 


0/6 (0) 6 


control PBS 


2/6 (33) 


0/6 (0) 



1 alhydrogei™ at a final concentration of 0.14 mg or 0.21 mg of Al was used; 

2 approximately 6X1 0 7 cfu; 

3 intraperitoneal challenge with 1 mL Todd-Hewitt culture medium containing GBS C388/90 

(la/c) suspension adjusted to 1.5X10 4 cfu; 

4 intraperitoneal challenge with 1 mL Todd-Hewitt culture medium containing GBS C388/90 

(la/c) suspension adjusted to 2.1 X10 6 cfu; 

5 not done; 

6 intraperitoneal challenge with 1 mL Todd-Hewitt culture medium containing GBS NCS246 

(ll/R) suspension adjusted to 1.2X1 0 s cfu. 

In another experiment, one group of 12 mice corresponding 
to a control group was injected with PBS, while a second 
group of 12 mice was immunized with formaldehyde killed 
whole cells of GBS strain C388/90 (Ia/c) . Six mice from 
each of these two groups were challenged with 2.1X10 6 cfu 
of the GBS strain C388/90 (Ia/c) (Table I) . As the first 
challenge experiment, all mice immunized with the GBS 
strain C388/90 (Ia/c) survived the homologous challenge. 
Only two out of the 6 mice injected with PBS survived the 
infection . 
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The remaining 6 mice in both groups were then used one 
week later to verify whether this antigenic preparation 
could confer cross protection against strain NCS246 (II/R) 
which produce a serologically distinct capsule. None of 
the mice infected with this second GBS strain survived the 
infection. The later result suggested that most of the 
protective immune response induced by formaldehyde killed 
strain C388/90 is directed against the capsular 
polysaccharide and that it could be restricted to strains 
of that particular serotype. These results clearly 
indicated that this particular model of infection can be 
efficiently used to study the protection conferred by 
vaccination . 

EXAMPLE 3 Immunization of rabbit with formaldehyde killed 

whole GBS cells and passive protection in mice 

A New Zealand rabbit (2.5 kg, Charles River, St-Constant, 
Quebec , Canada) was immunized with f ormaldehvde killed 
cells of GBS strain C388/90 (Ia/c) to obtain hyperimmune 
serum. This rabbit was injected subcutaneously three 
times at three weeks interval with approximately 1.5X10 9 
cfu of formaldehyde killed whole cells of GBS strain 
C388/90 (Ia/c). Freund ' s complete adjuvant (Gibco BRL 
Life Technologies, Grand Island, New York) was used as the 
adjuvant for the first immunization, while Freund ' s 
incomplete adjuvant (Gibco BRL) was used for the following 
two injections. Serum samples were obtained before the 
beginning of the immunization protocol and two weeks after 
the last injection. The sera were frozen at -20°C. 

The ability of this particular rabbit hyperimmune serum to 
passively protect mice against a lethal infection with GBS 
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was also evaluated. Intraperitoneal injection of mice with 
either 15 or 25 /iL of hyperimmune rabbit serum 18 hours 
before the challenge protected 4 out of 5 mice (80%) against 
the infection. Comparatively, survival rates lower than 20% 
5 were recorded for mice in the control group injected with 
PBS or serum obtained from a rabbit immunized with 
meningococcal outer membrane preparation. This result 
clearly indicates that the immunization of another animal 
species with killed GBS cells can induce the production of 
10 antibodies that can passively protect mice. This reagent 
will also be used to characterize clones. 

Table 2 Passive protection of CD-I mice conferred by 

rabbit serum obtained after immunization with 
15 formaldehyde killed group B whole streptococci 

(strain C388/90 (Ia/c) ) antigenic preparation 



groups 


number of living mice 14 
days after the bacterial 
challenge with GBS strain 
C388/90 (Ia/c) 2 


% 

survival 


rabbit hyperimmune 
serum 2 
- 25 Ml 


4/5 


80 


rabbit hyperimmune 
serum 1 
- 15 Ml 


4/5 


80 


control rabbit serum 
- 25 Ml 




20 


control PBS 


1/10 


10 



1 Freund's complete adjuvant was used for first 

20 immunization, and Freund's incomplete adjuvant for the 

following two injections; 

2 intraperitoneal challenge with 1 ml Todd-Hewitt culture 
medium containing GBS C388/90 (Ia/c) suspension adjusted 
to 2X10 4 cfu. 

25 
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EXAMPLE 4 Recombinant production of His . Tag-GBS fusion 

protein 



The coding region of a GBS gene was amplified by PCR (DNA 
Thermal Cycler GeneAmp PCR system 24 00 Perkin Elmer, San 
Jose, CA) from the genomic DNA of GBS strain C388/90 (Ia/c) 
using the oligos that contained base extensions for the 
addition of the restriction sites Bglll (AGATCT) and Hindlll 
(AAGCTT) , respectively. The PCR product was purified from 
agarose gel using a Qiaex II gel extraction kit from Qiagen 
(Chatsworth, CA) , digested with the restriction enzymes 
Bglll and Hindi I I (Pharmacia Canada Inc Baie d'Urfe, 
Canada), and extracted with phenol : chloroform before ethanol 
precipitation. The pET-32b(+) vector (Novagen, Madison, WI) 
containing the thioredoxin-His . Tag sequence was digested 
with the restriction enzymes Bglll and Hindlll, extracted 
with phenol: chloroform, and then ethanol precipitated. The 
Bglll -Hindlll genomic DNA fragment was ligated to the Bglll- 
HindHI P ET-32b( + ) vector to create the coding sequence for 
thioredoxin-His. Tag-GBS fusion protein whose gene was under 
control of the T7 promoter. The ligated products were 
transformed into E. coli strain XLI Blue MRF • (A (mcrA) 183A 
(mcrCB-hsdSMR-mrr) 173 endAl supE44 thi-1 recAl gyrA96 relAl 
lac [F'proAB lad'ZAMlSTnlO (Tef')] c ) (Stratagene, La Jolla, 
CA) according to the method of Simanis (Hanahan, D. DNA 
Cloning, 1985, D.M. Glover (ed.), pp. 109-135). The 
recombinant pET plasmid was purified using a Qiagen kit 
(Qiagen, Chatsworth, CA) and the nucleotide sequence of the 
DNA insert was verified by DNA sequencing (Taq Dye Deoxy 
Terminator Cycle Sequencing kit, ABI , Foster City, CA) . The 
recombinant pET plasmid was transformed by electroporation 
(Gene Pulser II apparatus, BIO-RAD Labs, Mississauga, 
Canada) into E . coli strain AD494 (DE3) (Aara'2eu76 97 
AlacX74 AphoA PvuII phoR AmalF3 F' {lac* (lad*) pro] 
trxB::Kan (DE3 ) ) (Novagen, Madison, WI). In this strain of 
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E. coli, the T7 promoter controlling expression of the 
fusion protein, is specifically recognized by the T7 RNA 
polymerase (present on the XDE3 prophage) whose gene is 
under the control of the lac promoter which is inducible by 
5 isopropyl-P-D- thio-galactopyranoside (IPTG) . 

The transformant AD4 94 (DE3 ) /rpET was grown at 37°C with 
agitation at 250 rpm in LB broth (peptone lOg/L, Yeast 
extract 5g/L, NaCl lOg/L) containing lOO^tg of ampicillin 

10 (Sigma-Aldrich Canada Ltd., Oakville, Canada) per mL until 
the A 600 reached a value of 0 . 6 . In order to induce the 
production of the thioredoxin-His . Tag-GBS fusion protein, 
the cells were incubated for 2 additional hours in the 
presence of IPTG at a final concentration of ImM. The 

15 bacterial cells were harvested by centrif ugation . 

The recombinant fusion protein produced by AD4 94 (DE3 ) /rpET3 2 
upon IPTG induction for 2h was partially obtained as 
insoluble inclusion bodies which were purified from 
20 endogenous E. coli proteins by the isolation of insoluble 

aggregates (Gerlach, G.F. et al 1992, Infect. Immun. 
60:892) . Induced cells from a 500 mL culture were 
resuspended in 20 mL of 25% sucrose-50mM Tris-HCl buffer 
(pH8.0) and frozen at -70°C. Lysis of cells in thawed 

25 suspension was achieved by the addition of 5mL of a solution 
of lysozyme (lOmg/mL) in 250mM Tris-HCl buffer (pH8.0) 
followed by an incubation of 10 to 15 min on ice, and the 
addition of 150mL of detergent mix (5 parts of 20mM Tris-HCl 
buffer [pH7 . 4] -300mM NaCl-2% deoxycholic acid-2% Nonidet P- 

30 40 and 4 parts of lOOmM Tris-HCl buffer [pH8] -50mM EDTA-2% 
Triton X-100) followed by 5 min incubation on ice. Upon 
sonication, protein aggregates were harvested by 
centrifugation for 30 min at 35,000 X g and a sample of the 
soluble cellular fraction was kept. The aggregated proteins 

35 were soiubilized in 6M guanidine hydrochloride. The 
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presence of the fusion protein in both the soluble and 
insoluble fractions was shown by Western Blot analysis using 
the serum of a mouse injected with formaldehyde killed cells 
of GBS strain C388/90 (Ia/c) that survived a bacterial 
5 challenge with the corresponding GBS strain. 

The purification of the fusion protein from the soluble 
fraction of IPTG-induced AD4 94 (DE3 ) /rpET was done by 
affinity chromatography based on the properties of the 

10 His. Tag sequence (G consecutive histidine residues) to bind 
to divalent cations (Ni 2+ ) immobilized on the His. Bind metal 
chelation resin (Novagen, Madison, WI) . The purification 
method used are those described in the pET system Manual, 
6th Edition (Novagen, Madison, WI) . Briefly, the pelleted 

15 cells obtained from a lOOmL culture induced with IPTG was 
resuspended in 4mL of Binding buffer (5mM imidazole- 500mM 
NaCl-20mM Tris-HCl pH7.9), sonicated, and spun at 39,000 X g 
for 20 min to remove debris. The supernatant was filtered 
(0.45(im pore size membrane) and deposited on a column of 

20 His. Bind resin equilibrated in Binding buffer. The column 
was then washed with 10 column volumes of Binding buffer 
followed by 6 column volumes of Wash buffer (2 0mM imidazole- 
500mM NaCl-20mM Tris-HCl pH7.9). The thioredoxin-His . Tag- 
GBS fusion protein was eluted with Elute buffer (1M 

25 imidazole-500mM NaCl-20mM Tris-HCl pH7.9). The removal of 

the salt and imidazole from the sample was done by dialysis 
against 3X1 liter PBS at 4°C. 

The quantities of fusion protein obtained from either the 
30 soluble or insoluble cytoplasmic fractions of E. coli were 
estimated by Coomassie staining of a sodium dodecyl sulfate 
(SDS) -polyacrylamide gel with serial dilutions of these 
proteins and a bovine serum albumin standard (Pierce 
Chemical Co. Rockford, 111.). 
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EXAMPLE 5 Recombinant production of GBS protein under 

control of lambda P L promoter 

The DNA coding region of a GBS protein was inserted 
5 downstream of the promoter XP^ into the translation vector 
pURV22. This plasmid was derived from p629 (George et al , 
1987, Bio/Technology 5:600) from which the coding region for 
a portion of the herpes simplex virus type I (HSV-I) 
glycoprotein (gD-1) was removed and the ampicillin 

10 resistance gene replaced by a kanamycin cassette obtained 
from the plasmid vector pUC4K (Pharmacia Biotech Canada 
Inc., Baie D'Urfe, Canada). The vector contained a cassette 
of the bacteriophage X cI8 57 temperature sensitive repressor 
gene from which the functional P R promoter had been deleted. 

15 The inactivation of the cI857 repressor by temperature 

increase from the ranges of 30-37°C to 37-42°C resulted in 
the induction of the gene under the control of 1 P L . The 

translation of the gene was controlled by the ribosome 
binding site cro followed downstream by a Bglll restriction 
20 site (AGATCT) and the ATG : ACTAAGGAGGTTAGATCTATG . 

Restriction enzymes and T4 DNA ligase were used according to 
suppliers (Pharmacia Biotech Canada Inc., Baie D'Urfe, 
Canada; and New England Biolabs Ltd., Mississauga, Canada) . 

25 Agarose gel electrophoresis of DNA fragments was performed 
as described by Sambrook et al . ( Molecular cloning : A 
laboratory Manual . 1989, Cold Spring Harbor Laboratory 
Press, N.Y) . Chromosomal DNA of the GBS bacteria was 
prepared according to procedures described in Jayarao et al 

30 (J. Clin. Microbiol., 1991, 29:2774). DNA amplification 

reactions by polymerase chain reaction (PCR) were made using 
DNA Thermal Cycler GeneAmp PCR system 24 00 (Perkin Elmer, 
San Jose, CA) . Plasmids used for DNA sequencing were 
purified using plasmid kits from Qiagen (Chatsworth, CA) . 

35 DNA fragments were purified from agarose gels using Qiaex II 
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gel extraction kits from Qiagen (Chatsworth, CA) . Plasmid 
transformations were carried out by the method described by 
Hanahan (DNA Cloning. Glover (ed.) pp, 109-135, 1985). The 
sequencing of genomic DNA inserts in plasmids was done using 
synthetic oligonucleotides which were synthesized by 
oligonucleotide synthesizer model 394 (the Perkin-Elmer 
Corp., Applied Biosystems Div. (ABI), Foster City, CA) . The 
sequencing reactions were carried out by PCR using the Taq 
Dye Deoxy Terminator Cycle Sequencing kit (ABI, Foster City, 
CA) and DNA electrophoresis was performed on automated DNA 
sequencer 373A (ABI, Foster City, CA) . The assembly of the 
DNA sequence was performed using the program Sequencer 3.0 
(Gene Codes Corporation, Ann Arbor, MI) . Analysis of the DNA 
sequences and their predicted polypeptides was performed 
with the program Gene Works version 2.45 ( Intelligenetics , 
Inc. , Mountain View CA) . 



The coding region of the GBS gene was amplified by PCR from 
GBS strain C388/90 (Ia/c) genomic DNA using oligos that 
contained base extensions for the addition of restriction 
sites Bglll (AGATCT) and Xbal (TCTAGA) , respectively. The PCR 
product was purified from agarose gel using a Qiaex II gel 
extraction kit from Qiagen (Chatsworth, CA) , digested with 
the restriction enzymes Bglll and Xbal, and extracted with 
25 phenol .-chloroform before ethanol precipitation. The pURV22 
vector was digested with the restriction enzymes Bglll and 
Xbal, extracted with phenol : chloroform, and ethanol 
precipitated. The Bglll -Xbal genomic DNA fragment was 
ligated to the Bglll-Xbal pURV22 vector in which the GBS 
30 gene was under the control of the 7.PL promoter. The ligated 
products were transformed into E. coll strain XL I Blue MRF' 
(A (mcrA) 183A (mcrCB-hsdSMR-mrr ) 173 endAl supE44 thi-1 recAl 
gyrASS relAl lac[F' proAB lacl q ZAM15 TnlO (Tet r ) ] c ) 
(Stratagene, La Jolla CA) according to the methods described 
35 in Hanahan, supra . Transf ormants harboring plasmids with the 

28 



BNSDOCID: <WO 9942588A2 f > 



WO 99/42588 



PCT/CA99/00114 



insert were identified by analysis of lysed cells submitted 
to electrophoresis on agarose gel (Sambrook et al, supra ) . 
The recombinant pURV22 plasmid was purified using a Qiagen 
kit (Qiagen, Chatsworth, CA) and the nucleotide sequence of 
5 the DNA insert was verified by DNA sequencing. 

The transf ormant XL I Blue MRF'/rpURV22 was grown at 34 °C 
with agitation at 250 rpm in LB broth containing 50|ag of 

kanamycin per mL until the A 600 reached a value of 0.6. In 
10 order to induce the production of the fusion protein, the 
cells were incubated for 4 additional hours at 3 9°C. The 
bacterial cells were harvested by centrif ugation , 
resuspended in sample buffer, boiled for 10 min and kept at 
-20°C. 

15 

EXAMPLE 6 Subcloning GBS protein gene in CMV plasmid pCMV-GH 

The DNA coding region of a GBS protein was inserted in phase 
20 downstream of the human growth hormone (hGH) gene which was 

under the transcriptional control of the cytomegalovirus 
(CMV) promoter in the plasmid vector pCMV-GH (Tang et al, 

Nature, 1992, 356:152). The CMV promoter is non functional 

in E. coli cells but active upon administration of the 
25 plasmid in eukaryotic cells. The vector also incorporated 

the ampicillin resistance gene. 

The coding region of the gene was amplified by PCR from 
genomic DNA of GBS strain C388/90 (Ia/c) using the oligos 

30 that contained base extensions for the addition of the 

restriction sites Bglll (AGATCT) and Hindi I I (AAGCTT) . The 
PCR product was purified from agarose gel using a Qiaex II 
gel extraction kit from Qiagen (Chatsworth, CA) , digested 
with the restriction enzymes Bglll and Hindlll, and 

35 extracted with phenol : chloroform before ethanol 

precipitation. The pCMV-GH vector (Laboratory of Dr. Stephen 
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10 



15 



20 



25 



30 



A. Johnston, Department of Biochemistry, The University of 
Texas, Dallas, Texas) containing the human growth hormone to 
create fusion proteins was digested with the restriction 
enzymes BamHI and Hindi II, extracted with phenol : chloroform, 
and ethanol precipitated. The 1.3-kb Bglll-Hindlll genomic 
DNA fragment was ligated to the BamHI -Hindlll pCMV-GH 
vector to create the hGH-GBS fusion protein under the 
control of the CMV promoter. The ligated products were 
transformed into E . coli strain DH5a [<j>80 2acZ AM15 endAl 
recAl hsdRll Ck-K*) supE44 thi-lV gyrA96 relAl A (lacZYA- 
argF)U169] (Gibco BRL, Gaithersburg, MD) according to the 
methods described by Hanahan, supra . Trans formants 
harboring plasmids with the insert were identified by 
analysis of lysed cells submitted to electrophoresis on 
agarose gel (Sambrook, J. et al , supra ) . The recombinant 
pCMV plasmid was purified using a Qiagen kit (Qiagen, 
Chatsworth, CA) and the nucleotide sequence of the DNA 
insert was verified by DNA sequencing. 



EXAMPLE 7 Immunological activity of GBS protein to GBS 

challenge 

Four groups of 12 female CD-I mice (Charles River, St- 
Constant, Quebec, Canada) of 6 to 8 weeks were injected 
subcutaneous ly three times at three week intervals with 
O.lmL of the following antigenic preparations: formaldehyde 
killed cells of GBS strain C388/90 ("6X10 7 cfu) , 20ng of 
thioredoxin-His.Tag-GBS fusion protein obtained from the 
insoluble (inclusion bodies) or 20ug of the fusion protein, 
affinity purified (nickel column) , from the soluble 
cytoplasmic fraction in E.coli, or 20ug of affinity purified 
(nickel column) thioredoxin-His . Tag control polypeptide. 
20^g of QuilA™ (Cedarlane Laboratories Ltd, Hornby, Canada) 
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was added to each antigenic preparation as the adjuvant. 
Serum samples were obtained from each mouse before 
immunization (PB) and on days 20 (TBI) , 41 (TB2) and 54 
(TB3) during the immunization protocols. Sera were frozen 
5 at -20°C. 

An increase of the ELISA titers was recorded after each 
injection of the fusion protein indicating a good primary- 
response and a boost of the specific humoral immune response 

10 after each of the second and third administration. At the 
end of the immunization period, the means of reciprocal 
ELISA titers was 456,145 for the group immunized with 20|ig 
of fusion protein obtained from inclusion bodies compared to 
290,133 for the group of mice immunized with the protein 

15 from soluble fraction in E.coli. The latter result suggests 

that the protein obtained from inclusion bodies could be 
more immunogenic than the soluble protein. Analysis of mice 
sera in ELISA using the affinity purified thioredoxin- 
His.Tag to coat plates showed that negligible antibody 

20 titers are made against the thioredoxin-His . Tag portion of 
the fusion protein. The reactivity of the sera from mice 
injected with the recombinant fusion protein was also tested 
by ELISA against formaldehyde killed whole cells of GBS 
strain C388/90. The antibodies induced by immunization with 

25 recombinant fusion protein also recognized their specific 

epitopes on GBS cells indicating that their conformation is 
close enough to the native streptococcal protein to induce 
cross -reactive antibodies . 

30 To verify whether the immune response induced by 

immunization could protect against GBS infection, mice were 
challenged with 3.5X10 5 cfu of GBS strains C338/90 ( Ia/c) and 
1.2X10 5 cfu of strain NCS246(II/R) the results of which are 
illustrated in tables 3 and 4 respectively. Mice immunized 

35 with control thioredoxin-His . Tag peptide were not protected 
against challenge with either GBS strain while those 
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immunized with formaldehyde killed C388/90 whole cells only 
provided protection against homologous challenge. The 
thioredoxin-His.Tag-GBS fusion protein of the invention 
protected mice from challenge with both GBS strains. Blood 
and spleen culture of these mice did not reveal the presence 
of any GBS. 
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Table 3 Survival from GBS strain C388/90 (Ia/c) challenge 1 



immunizing agent 


■ i 

no . mice 
surviving | 
challenge 


o, 
o 

survival 


thioredoxin-His . Tag' 


1/6 


17 


^ X _ j fn r~ T****» ■ lll*itHlil*iimt ■ ********* w** V** M • * **** *********** *** * ■ * * *■» ■ »* »*■ * **** * ll»ll**M **^* ■ 

formaldehyde killed C388/90 
cens 


6/6 


100 


thioredoxin-His . Tag -GBS fusion 
(inclusion body preparation) 4 


6/6 


100 


thioredoxin-His . Tag-GBS fusion 
(cytoplasmic fraction) 4 


| 6/6 


100 



1 intraperitoneal administration with 1 ml Todd-Hewitt 
5 culture medium adjusted to 3.5X10 S cfu; 

2 20^ig administered; posterior legs paralyzed in surviving 
mouse; GBS detected in blood and spleen; 

3 6X10 7 cfu administered; 

4 2 0jig administered. 
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Table 4 



Survival from GBS strain NCS246 (II/R) challenge 1 



immunizing agent 



thioredoxin-His . Tag : 



formaldehyde killed C388/90 
cells 3 



thioredoxin-His . Tag-GBS fusion 
(inclusion body preparation) 2 


5 / 5 4 


100 


thioredoxin-His .Tag-GBS fusion 
(cytoplasmic fraction) 2 


6/6 


100 



no . mice 
surviving 
challeng e 

0/6 



2/6 



% survival 



0 



34 



10 



intraperitoneal administration with 1 ml Todd-Hewitt 
culture medium containing GBS NCS246(II/R) suspension 
adjusted to 1.2X10 5 cfu. 

2 2 0|ig administered; 

3 6X10 7 cfu administered; 

one mouse died during immunization. 



EXAMPLE 8 Immunization with recombinant GBS protein confers 

protection against experimental GBS infection 

15 

This example illustrates the protection of mice against 
fatal GBS infection by immunization with the recombinant 
protein corresponding to the SEQ ID NO: 39. 

20 Groups of 10 female CD-I mice (Charles River) were immunized 
subcutaneously three times at three-week intervals with 20 
pg of recombinant protein purified from E^ coli strain BLR 
(Novagen) harboring the recombinant pURV22 plasmid vector 
containing the GBS gene corresponding to SEQ ID NO: 42 in 

25 presence of 20 pg of QuilA™ adjuvant (Cedarlane 

Laboratories Ltd, Hornby, Canada) or, as control, with 
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QuilA™ adjuvant alone in PBS. Blood samples were collected 
from the orbital sinus on day 1, 22 and 43 prior to each 
immunization and fourteen days (day 57) following the third 
injection. One week later the mice were challenged with 
5 approximately 10 4 to 10 6 CFU of various virulent GBS strains. 
Samples of the GBS challenge inoculum were plated on TSA/5% 
sheep blood agar plates to determine the CFU and to verify 
the challenge dose. Deaths were recorded for a period of 14 
days and on day 14 post-challenge, the surviving mice were 
10 sacrificed and blood and spleen were tested for the presence 
of GBS organisms. The survival data are shown in table 5. 

Prechallenge sera were analyzed for the presence of 
antibodies reactive with GBS by standard immunoassays. Elisa 
15 and immunoblot analyses indicated that immunization with 
recombinant GBS protein produced in E . coll elicited 
antibodies reactive with both, recombinant and native GBS 
protein. Antibody responses to GBS are described in Example 
9. 

20 
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Table 5. Ability of recombinant GBS protein corresponding 
to SEQ ID NO: 39 to elicit protection against 8 diverse GBS 
challenge strains 





Challenge strain 




Immunogen 


Designation 


Type 


No. alive: No. dead 1 


rGBS protein 
none 


C388/90 


Ia/c 


8 : 2 (P<0.0001) 
0 : 10 


rGBS protein 
none 


NCS 24 6 


II/R 


10 : 
3 : 


: 0 (P=0.0012) 
: 7 


rGBS protein 
none 


ATCC12401 


lb 


10 : 

3 : 


: 0 (P=0.001) 
: 7 


rGBS protein 
none 


NCS 535 


V 


10 : 
5 : 


: 0 (P=0.01) 

: 5 


rGBS protein 
none 


NCS 9842 


VI 


10 : 
0 : 


: 0 (P<0.0001) 
10 


rGBS protein 

NCS 915-F 3 
none 


NCS 915 


III 


7 : 3 (P=0.0007) 2 
1 : 9 

4 : 6 


rGBS protein 
NCS 954-F 
none 


NCS 954 


III/R 


7:3 ( P=0.002) 
4 : 6 
1 : 9 


rGBS protein 

C0H1-F 

none 


C0H1 


III 


4 : 6 (P-0.0004) 
3 : 7 
0 : 10 



1 Groups of 10 mice per group were used, the number of mice 

surviving to infection and the number of dead mice are 
10 indicated. The survival curves corresponding to recombinant 

GBS protein-immunized animals were compared to the survival 
curves corresponding to mock-immunized animals using the log- 
rank test for nonparametric analysis. 

2 Comparison analysis to NCS915-F-immunized animals. 

15 Animals were immunized with formaldehyde-killed GBS i 

presence of QuilA™ adjuvant. 



Ail hemocultures from surviving mice were negative at day 14 
20 post-challenge. Spleen cultures from surviving mice were 
negative except for few mice from experiment MB-11. 
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EXAMPLE 9 Vaccination with the recombinant GBS protein 

elicits an immune response to GBS 

Groups of 10 female CD-I mice were immunized subcutaneously 
5 with recombinant GBS protein corresponding to SEQ ID NO: 39 
as described in Example 8. In order to assess the antibody 
response to native GBS protein, sera from blood samples 
collected prior each immunization and fourteen days after 
the third immunization were tested for antibody reactive 

10 with GBS cells by ELISA using plates coated with 

formaldehyde-killed GBS cells from type III strain NCS 954, 
type lb strain ATCC124 01, type V strain NCS 535 or type VI 
strain NCS 9842. The specificity of the raised antibodies 
for GBS protein was confirmed by Western blot analyses to 

15 GBS cell extracts and purified recombinant antigens. The 

results shown in Figure 10 clearly demonstrate that animals 
respond strongly to recombinant GBS protein used as 
immunogens with median reciprocal antibody titers varying 
between 12000 and 128000, for sera collected after the third 

20 immunization, depending of the coating antigen. All 

preimmune sera were negative when tested at a dilution of 
1 :100. GBS-reactive antibodies were detectable in the sera 
of each animal after a single injection of recombinant GBS 
protein . 

25 
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Example 10 Antigenic conservation of the GBS protein of the 

present invention 

Monoclonal antibodies (MAbs) specific to the GBS protein of 
5 the present invention were used to demonstrate that this 
surface antigen is produced by all GBS and that it is also 
antigenically highly conserved. 

A collection of 68 GBS isolates was used to evaluate the 
10 reactivity of the GBS-specific MAbs. These strains were 
obtained from the National Center for Streptococcus, 
Provincial Laboratory of Public Health for Northern Alberta, 
Canada; Centre Hospitalier Universitaire de Quebec, Pavilion 
CHUL, Quebec, Canada; American Type Culture Collection, USA; 
15 Laboratoire de Sante Publigue du Quebec, Canada; and Dept. 
of Infectious Disease, Children's Hospital and Medical 
Center, Seattle, USA. All eight Mabs were tested against 
the following panel of strains: 6 isolates of serotype la or 
Ia/c, 3 isolates of serotype lb, 4 isolates of serotype II, 
20 14 isolates of serotype III, 2 isolates of serotype IV, 2 
isolates of serotype V, 2 isolates of serotype VI, 2 
isolates of serotype VII, i isolate of serotype VIII, 10 
isolates that were not serotyped and 3 bovine S. agalactiae 

strains. MAb 3A2 was also reacted with additional GBS: 9 
25 isolates of serotype Ia/c and 10 isolates of serotype V. 
The strains were grown overnight on blood agar plates at 
37°C in an atmosphere of 5% C0 2 . Cultures were stored at - 

70°C in heart infusion broth with 20% (v/v) glycerol. 

30 To obtain the GBS protein-specific MAbs, mice were immunized 
three times at three-week intervals with 20 jzg of purified 
recombinant GBS protein. (SEQ ID NO :44) in the presence of 
20% QuilA™ adjuvant. Hybridoma cell lines were generated by 
fusion of spleen cells recovered from immunized mice with 

35 the nonsecreting SP2/0 myeloma cell line as described 
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previously (Hamel, J. et al . 1987. J. Med. Microbiol. 
23:163-170). Hybrid clone supernatants were tested for 
specific antibody production by ELISA using formaldehyde 
inactivated GBS and purified recombinant GBS protein (SEQ ID 
5 NO : 39 or 44) as coating antigen, as previously described 
(Hamel, J. et al . 1987. J. Med. Microbiol. 23:163-170). 
Specific hybrid were cloned by limiting dilutions, expanded, 
and frozen in liquid nitrogen. Production of recombinant GBS 
protein was presented in Examples 4 & 5 . Purified 

10 recombinant GBS protein or formaldehyde inactivated GBS were 
resolved by electrophoresis by using the discontinuous 
buffer system of Laemmli as recommended by the manufacturer 
and then transfer onto nitrocellulose membrane for Western 
immunoblotting as described previously (Martin et al . 1992. 

15 Infect. Immun. 60:2718-2725). 

Western immunoblotting experiments clearly indicated that 
all eight MAbs recognized a protein band that corresponded 
to the purified recombinant GBS protein (SEQ ID NO :3 9) . 

20 These MAbs also reacted with a protein band present in every 
GBS isolates tested so far. The reactivity of these GBS- 
specif ic MAbs are presented in Table 6 . Each MAb reacted 
well with all 46 GBS. In addition, these MAbs also 
recognized the 3 S. agalactiae strains of bovine origin that 

25 were tested. MAb 3A2 also recognized nineteen GBS; 9 

isolates of serotype Ia/c and 10 of serotype V. The other 
MAbs were not tested against these additional strains. 

These results demonstrated that the GBS protein (SEQ ID 
30 NO :39) was produced by all the 6 5 GBS and the three 3 S. 

agalactlae strains of bovine origin that were tested so far. 
More importantly, these results clearly demonstrated that 

the epitopes recognized by these eight GBS-specific MAbs 

were widely distributed and conserved among GBS. These 
35 results also indicated that these epitopes were not 
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restricted to serologically related isolates since 
representatives of all known GBS serotypes including the 
major disease causing groups were tested. 

5 In conclusion, the data presented in this example clearly 
demonstrated that the GBS protein of the present invention 
is produced by all GBS and that it is antigenically highly 
conserved. 



40 



BNSDOCID- <WO 9942588A2J_> 



WO 99/42588 



PCT/CA99/00114 



ro 
c 

nj 
U 
u 

CO 

QJ 
(TS 
*H 
XJ 

o 

(IS 

nj 

CD 
«0 



CO 

xJ 
C 
Q) 
J-i 

0) 
4H 
U-J 
•H 

TJ 

-H 

to 



u 

•f-i 
*r- ! 

u 

0) 

a 
to 

t 

-r-f 

a) 
j-j 

o 

^ • 

XJ 

CD O 

CD rH 

O 

j—> e 

x: 

en £ 

*h e 





c 


0 


u 




0) 




XJ 


4-) 


CO 


-r—E 


0) 


> 








XJ 




U 


XI 


fO 




OJ 


Tj" 








XJ 




m 








i — i 






QJ 


> 


i— l 








m 


to 


H 


TO 






73 


o 




u 


& 


Q) 




CO 


xJ 




O 


u-i 


>h 


O 


CD 




CO 


CO 




C 


XJ 




O 


m 


C 


in 




XJ 


a> 


CO 


u 




a> 


rH 








C 


to 


O 


c 


-H 




XJ 


m 


-H 






XJ 




CO 




• 




BNSDOC1D. <WO 9942588A2 I > 



WO 99/42588 



PCT/CA99/00114 



WE CLAIM: 

1. An isolated polynucleotide encoding a polypeptide 
having at least 70% identity to a second polypeptide 
having a sequence selected from the group consisting 
of: 

SEQ ID NO: 2, SEQ ID NO : 3, SEQ ID NO: 4, SEQ ID NO: 5, 
SEQ ID NO: 6, SEQ ID NO : 8, SEQ ID NO: 9, SEQ ID NO: 10, 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO : 14 , SEQ ID NO : 15 , 
SEQ ID NO: 16 , SEQ ID NO :17, SEQ ID NO: 18, SEQ ID NO: 19, 
SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 24, 
SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, 
SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 38, SEQ ID NO : 3 9 , 
SEQ ID NO: 40, SEQ ID NO: 41 and SEQ ID NO: 44 or 
fragments, analogs or derivatives, thereof . 

2. A polynucleotide according to claim 1, wherein said 
polynucleotide encodes a polypeptide having at least 
95% identity to the second polypeptide. 

3 . An isolated polynucleotide encoding a polypeptide 
capable of generating antibodies having binding 
specificity for a polypeptide having a sequence 
selected from the group consisting of : 

SEQ ID NO: 2, SEQ ID NO : 3, SEQ ID NO : 4, SEQ ID NO : 5, 
SEQ ID NO: 6, SEQ ID NO : 8, SEQ ID NO : 9, SEQ ID NO: 10, 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO : 14 , SEQ ID NO: 15, 
SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO : 18 , SEQ ID NO:19, 
SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:24, 
SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, 
SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO : 3 9 , 
SEQ ID NO: 40, SEQ ID NO: 41 and SEQ ID NO: 44 or 
fragments, analogs or derivatives thereof. 
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4. An isolated polynucleotide that is complementary to the 
polynucleotide of claim 1. 

5. An isolated polynucleotide that is complementary to the 
polynucleotide of claim 3 . 

6. The polynucleotide of claim 1, wherein said 
polynucleotide is DNA . 

7. The polynucleotide of claim 3, wherein said 
polynucleotide is DNA. 

8. The polynucleotide of claim 1, wherein said 
polynucleotide is RNA . 

9. The polynucleotide of claim 3, wherein said 
polynucleotide is RNA. 

10. A polynucleotide which hybridizes under stringent 
conditions to a second polynucleotide having a sequence 
selected from the group consisting of : 

SEQ ID NO : 1, SEQ ID NO : 7, SEQ ID NO : 13 , SEQ ID 
NO : 22, SEQ ID NO : 27, SEQ ID NO : 32, SEQ ID NO : 
37, SEQ ID NO : 42 and SEQ ID NO : 43 or fragments, 
analogues or derivatives thereof . 

11. A polynucleotide which hybridizes under stringent 
conditions to a second polynucleotide having a sequence 
selected from the group consisting of : 

SEQ ID NO : 37, SEQ ID NO : 42 and SEQ ID NO : 43 . 

12. A polynucleotide according to claim 11 which hybridizes 
under stringent conditions to a second polynucleotide 
having the sequence SEQ ID NO : 37. 
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13. A polynucleotide according to claim 11 which hybridizes 
under stringent conditions to a second polynucleotide 
having the sequence SEQ ID NO : 42. 

14. A polynucleotide according to claim 11 which hybridizes 
under stringent conditions to a second polynucleotide 
having the sequence SEQ ID NO : 43. 

15. A polynucleotide according to claim 10 wherein said 
polynucleotide has at least 95% complementarity to the 
second polynucleotide. 

16. A polynucleotide according to claim 11 wherein said 
polynucleotide has at least 95% complementarity to the 
second polynucleotide. 

17. A vector comprising the polynucleotide of claim 1, 
wherein said polynucleotide is operably linked to an 
expression control region. 

18. A vector comprising the polynucleotide of claim 3, 
wherein said polynucleotide is operably linked to an 
expression control region. 

19. A host cell transfected with the vector of claim 17. 

20. A host cell transfected with the vector of claim 18. 

21. A process for producing a polypeptide comprising 
culturing a host cell according to claim 19 under 
conditions suitable for expression of said polypeptide. 

22. A process for producing a polypeptide comprising 
culturing a host cell according to claim 20 under 
condition suitable for expression of said polypeptide. 



44 



BNSDOCID- <WO _ .9942589A2 I > 



WO 99/42588 



PCT/CA99/00114 



23. An isolated polypeptide having at least 70% identity to 
a second polypeptide having a sequence selected from 
the group consisting of: 

SEQ ID NO: 2, SEQ ID NO : 3, SEQ ID NO : 4, SEQ ID NO : 5, 
SEQ ID NO: 6, SEQ ID NO : 8, SEQ ID NO : 9, SEQ ID NO: 10, 
SEQ ID NO: 11, SEQ ID NO : 12 , SEQ ID NO : 14 , SEQ ID NO: 15, 
SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 
SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 24, 
SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, 
SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO : 3 9 , 
SEQ ID NO: 40, SEQ ID NO: 41 and SEQ ID NO: 44 or 
fragments, analogs or derivatives thereof. 

24. The isolated polypeptide of claim 23 having a sequence 
according to SEQ ID NO : 39. 

25. The isolated polypeptide of claim 23 having a sequence 
according to SEQ ID NO : 44. 

26. An isolated polypeptide capable of generating 
antibodies having binding specificity for a second ■ 
polypeptide having a sequence selected from the group 
consisting of: 

SEQ ID NO: 2, SEQ ID NO : 3, SEQ ID NO : 4, SEQ ID NO : 5, 
SEQ ID NO: 6, SEQ ID NO : 8, SEQ ID NO : 9, SEQ ID NO: 10, 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO : 14 , SEQ ID NO: 15, 
SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 
SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 24, 
SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, 
SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 33, SEQ ID NO: 34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO : 3 8 , SEQ ID NO : 3 9 , 
SEQ ID NO: 40, SEQ ID NO: 41 and SEQ ID NO: 44 or 
fragments, analogs or derivatives thereof. 



45 



BNSDOCID <WO 99425B8A2 .I > 



WO 99/42588 



PCT/CA99/00114 



27. The isolated polypeptide of claim 26 having a sequence 
according to SEQ ID NO : 39. 

28. The isolated polypeptide of claim 26 having a sequence 
according to SEQ ID NO : 44 . 

29. An isolated polypeptide having an amino acid sequence 
selected from the group consisting of: 

SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, 
SEQ ID NO: 6, SEQ ID NO : 8, SEQ ID NO : 9, SEQ ID NO: 10, 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 15, 
SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 
SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 24, 
SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, 
SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO : 3 3 , SEQ ID NO:34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO : 3 8 , SEQ ID NO:39, 
SEQ ID NO: 40 and SEQ ID NO: 41 or fragments, analogs or 
derivatives thereof . 

30. The isolated polypeptide of claim 29 having an amino 
acid sequence according to SEQ ID NO : 39. 

31. An isolated polypeptide having an amino acid sequence 
according to SEQ ID NO ; 44 . 

32. An isolated polypeptide according to any one of claims 
2 9 to 31, wherein the N- terminal Met residue is 
deleted . 

33. An isolated polypeptide according to any one of claims 
29 to 30, wherein the secretory amino acid sequence is 
deleted . 

34 . A vaccine composition comprising a polypeptide according 
to any one of claims 23 to 31 and a pharmaceutically 
acceptable carrier, diluent or adjuvant. 
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35. A vaccine composition comprising a polypeptide 
according to claim 32 and a pharmaceutically acceptable 
carrier, diluent or adjuvant. 

36. A vaccine composition comprising a polypeptide 
according to claim 33 and a pharmaceutically acceptable 
carrier, diluent or adjuvant. 

37. A method for therapeutic or prophylactic treatment of 
streptococcal bacterial infection in an animal 
susceptible to streptococcal infection comprising 
administering to said animal a therapeutic or 
prophylactic amount of a composition according to claim 

34 . 

38. A method for therapeutic or prophylactic treatment of 
streptococcal bacterial infection in an animal 
susceptible to streptococcal infection comprising 
administering to said animal a therapeutic or 
prophylactic amount of a composition according to claim 

35 . 

39. A method for therapeutic or prophylactic treatment of 
streptococcal bacterial infection in an animal 
susceptible to streptococcal infection comprising 
administering to said animal a therapeutic or 
prophylactic amount of a composition according to claim 

36 . 

40. A method according to any one of claims 3 7 to 39, 
wherein said animal is a bovine . 

41. A method according to any one of claims 3 7 to 39, 
wherein said animal is a human. 
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42 . 



43 . 
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A method according to any one of claims 37 to 39, 
wherein said bacterial infection is selected from the 
group consisting of group A streptococcus and group B 
streptococcus . 

A method according to claim 42, wherein said bacterial 
infection is group B streptococcus . 



44. Use of a vaccine composition according to claim 34 for 
the therapeutic or prophylactic treatment of 
streptococcal bacterial infection in an animal 
susceptible to or infected with streptococcal infection 
comprising administering to said animal a therapeutic 
or prophylactic amount of the composition. 



45 . Use of a vaccine composition according to any one of 
claims 3 5 to 36 for the therapeutic or prophylactic 
treatment of streptococcal bacterial infection in an 
animal susceptible to or infected with streptococcal 
infection comprising administering to said animal a 
therapeutic or prophylactic amount of the composition. 



47 . 



Use of a vaccine composition according to any one 
claims 23 to 31 for the manufacture of a vaccine for 
the therapeutic or prophylactic treatment of 
streptococcal bacterial infection in an animal 
susceptible to streptococcal infection comprising 
administering to said animal a therapeutic or 
prophylactic amount of the composition. 

Use of a vaccine composition according to claim 32 for 
the manufacture of a vaccine for the therapeutic or 
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prophylactic treatment of streptococcal bacterial 
infection in an animal susceptible to streptococcal 
infection comprising administering to said animal a 
therapeutic or prophylactic amount of the composition. 

48. Use of a vaccine composition according to claim 33 for 
the manufacture of a vaccine for the therapeutic or 
prophylactic treatment of streptococcal bacterial 
infection in an animal susceptible to streptococcal 
infection comprising administering to said animal a 
therapeutic or prophylactic amount of the composition . 
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TATCTGGCAA AGAGCCAGCT AATCGTTTTA GTTGGGCTAA AAATAAATTA TTAATCAATG 60 
SGK EPA NRFS WAK NKL LING 

GATTCATTGC AACTCTAGCA GCAACTATCT TATTTTTTGC AGTTCAATTC ATAGGTCTTA 120 
F I A TLA ATIL F F A VQF IGLK 

AACCAGATTA CCCTGGAAAA ACCTACTTTA TTATCCTATT GACAGCATGG ACTTTGATGG 18 0 
PDY PGK TYFI ILL TAW TLMA 

CATTAGTAAC TGCTTTAGTG GGATGGGATA ATAGGTATGG TTCCTTCTTG TCGTTATTAA 24 0 
LVT ALV GWDN RYG SFL SLLI 

TAT TAT TAT T CCAGCTTGGT TCAAGCGCAG GAACTTACCC AATAGAATTG AGTCCTAAGT 300 
LLF QLG SSAG TYP IEL SPKF 

TCTTTCAAAC AATTCAACCA TTTTTACCGA TGACTTACTC TGTTTCAGGA TTAAGAGAGA 360 
FQT I Q P FLPM TYS VSG LRET 

CCATCTCGTT GACGGGAGAC GTTAACCATC AATGGAGAAT GCTAGTAATC TTTTTAGTAT 4 20 
ISL TGD VNHQ WRM LVI FLVS 

CATCGATGAT ACTTGCTCTT CTTATTTATC GTAAACAAGA AGATTAATAG AAAGT AT CT A 4 80 
SMI LAL LIYR KQE D 

GTGATAGACT AACAGTATGA TATGGTATGT CAAAGTATTT AGGAGGAGAA GATATGTCTA 54 0 

M S T 

|— > 

CTTTAACAAT AATTATTGCA ACATTAACTG CTTTGGAACA TTTTTATATT ATGTATTTGG 600 
L T I I I A TLTA LEH FYI MYLE 

AGACGTTAGC CACCCAGTCA AATATGACTG GGAAGATTTT TAGTATGTCT AAAGAAGAGT 660 
TLA TQS NMTG K I F SMS KEEL 

TGTCATATTT ACCCGTTATT AAACTTTTTA AGAATCAAGG TGTATACAAC GGCTTGATTG 720 
SYL PVI KLFK NQG VYN GLIG 

GCCTATTCCT CCTTTATGGG TTATATATTT CACAGAATCA AGAAATTGTA GCTGTTTTTT 780 
LFL LYG LYIS QNQ E I V AVFL 

TAATCAATGT ATTGCTAGTT GCTATTTATG GTGCTTTGAC AGTTGATAAA AAAATCTTAT 840 
I N V LLV AIYG ALT V DK KILL 

TAAAACAGGG TGGTTTACCT ATATTAGCTC TTTTAACATT CTTATTTTAA TACTACTTAG 900 
KQG GLP ILAL LTF LF 

CCGTTCGATT TAGTTGAACG GCTTTTAGTA ATCATTTTTT TCTCATAATA CAGGTAGTTT 960 

AAGTAATTTG TCTTTAAAAA TAGTATAATA TAACTACGAA TTCAAAGAGA GGTGACTTTG 1020 

ATT AT G ACT G AGAACTGGTT AC AT AC T AAA GATGGTTCAG ATATTTATTA TCGTGTCGTT 108 0 
MTE NWL HTK DGSD IYY RVV 

I — > 

GGTCAAGGTC AACCGATTGT TTTTTTACAT GGCAATAGCT TAAGTAGTCG CTATTTTGAT 114 0 
GQGQ P I V FLH GNSL SSR YFD 

AAGCAAATAG CATATTTTTC TAAGTATTAC CAAGTTATTG T TAT G GAT AG TAGAGGGCAT 1200 
KQIA YFS KYY QVIV MDS RGH 

GGCAAAAGTC ATGCAAAGCT AAATACCATT AGTTTCAGGC AAATAGCAGT TGACTTAAAG 12 60 
GKSH AKL N T I SFRQ IAV DLK 
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GATATCTTAG TTCATTTAGA GAT TG AT AAA GTTATATTGG TAGGCCATAG CGATGGTGCC 1320 
DILV HLE I D K VILV GHS DGA 

AATTTAGCTT TAGTTTTTCA AACGATGTTT CCAGGTATGG TTAGAGGGCT TTTGCTTAAT 1380 
NLAL VFQ T M F PGMV RGL LLN 

TCAGGGAACC TGACTATTCA TGGTCAGCGA TGGTGGGATA TTCTTTTAGT AAGGATTGCC 14 40 
S G N L T I H GQR WWDI LLV RIA 

TATAAATTCC TTCACTATTT AGGGAAACTC TTTCCGTATA TGAGGCAAAA AGCTCAAGTT 1500 
YKFL HYL GK-L FPYM RQK AQV 

ATTTCGCTTA TGTTGGAGGA TTTGAAGATT AGTCCAGCTG ATTTACAGCA TGTGTCAACT 1560 
I S L M LED L K I SPAD LQH VST 

CCTGTAATGG TTTTGGTTGG AAATAAGGAC ATAATTAAGT TAAATCATTC TAAGAAACTT 1620 
PVMV L V G N K D I I K L NHS KKL 

GCTTCTTATT TTCCAAGGGG GGAGTTTTAT TCTTTAGTTG GCTTTGGGCA TCACATTATT 1680 
AS YF PRG EFY SLVG FGH HII 

AAGCAAGATT CCCATGTTTT TAATATTATT GCAAAAAAGT TTATCAACGA TACGTTGAAA 1*740 
KQDS HVF Nil AKKF IND TLK 

GGAGAAATTG TTGAAAAAGC TAATTGAAAA AGTCAAATCA CTGACTTCTG TGATTAAAAT 1800 
GEIV EKA N 

TGTATTTTTT ATATCTGTTT TAGTGCTTAT TATTGTTGAA ATGATTCATT T GAAACG AAC 18 60 

M I H L K R T 



TATTTCTGTT GAGCAACTAA AGAGTGTTTT TGGGCAATTA TCTCCAATGA ATCTTTTCTT 1920 
ISV EQLK SVF G QL SPMN LFL 

AATTATCCTT GTGGGGGTTA TCGCTGTCTT ACCGACAACC GGATATGACT TTGTACTGAA 1980 
I I L V G V I A V L PTT GYDF VLN 

TGGACTTTTA CGTACAGATA AAAGCAAAAG GTATATTTTA CAGACTAGTT GGTGTATCAA 204 0 
GLLR TDKS KRY I L Q TSWC IN 

CACTTTTAAT AACTTGTCAG GATTCGGTGG CTTAATCGAT ATTGGGTTGC GCATGGCTTT 2100 
T F N N LS. GF GGL IDI GLRM A F 

TTATGGTAAA AAAGGTCAAG AG AAGAGT GA CCTAAGAGAA GTGACTCGTT TTTTACCCTA 2160 
YGKK GQEK SDL REV TRFL PY 

TCTTATTTCT GGTCTGTCAT TTATTAGTGT GATTGCCTTA ATCATGAGCC ATATTTTTCA 2220 
LISG LSFI SVI ALI MSHI F H 

TGCCAAAGCT AGTGTTGATT ACTATTATTT GGTATTAATT GGTGCTAGTA TGTATTTTCC 228 0 
AK AS VDYY YLV LIG ASMY FP 

TGTTATTTAT TGGATTTCTG GTCATAAAGG AAGCCATTAT TTCGGAGATA TGCCATCTAG 234 0 
VIYW ISGH KGS HYF GDMP SS 

TACTCGTATA AAATTAGGTG TTGTTTCTTT TTTTGAATGG GGATGTGCGG CCGCAGCATT 24 00 
T R I K LGVV SFF EWG C A A A AF 

TATAATTATC GGTTATTTAA TGGGCATTCA TCTACCAGTT TATAAAATTT TACCACTATT 24 60 
I I I G YLMG IHL PVY KILP L F 
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TTGTATTGGT TGTGCCGTCG GGATTGTATC CCTTATTCCC GGTGGATTAG GAAGTTTTGA 2520 
CIGC AVGI VSL IPG GLGS FE 



ATTAGTTCTA^TTTACAGGGT TTGCTGCCGA GGGACTACC^AAAGAAACTG TGGTTGCATG 2580 

GTTATTACTT^TATCGTTTAG CCTACTATAT TATTCCATTy TTGCAGGTA TCTATTTCTT 2640 

TATCCATTATJTAGGTAGTC AAATAAATCA ACGTTATGAA AATGTCCCGA AAGAGTTAGT 2700 

Hj L 

ATCAACTGTT^TACAAACCA TGGTGAGCCA TTTGATGCGT ATTTTAGGTG CATTCTTAAT 2760 

i ^ G £i F* Xj X 



ATTTTCAACA GCATTTTTTG AAAATATTAC TTATATTATG TGGTTGCAGA AGCTAGGCTT 2820 
t b T A FFEN ITY IMW LQKL GL 

GGACCCATTA CAAGAACAAA TGTTATGGCA GTTTCCAGGT TTATTGCTGG GGGTTTGTTT 2880 
UFLQ EQML WQF PGL LLGV C F 



2940 



TA.TTCTCTTA GCTAGAACTA TTGATCAAAA AGTGAAAAAT GCTTTTCCAA TTGCTATTAT 

RTID QKV KNA F P I a II 

CTGGATTACT TTGACATTGT TTTATCTTAA TTTAGGTCAT ATTAGTTGGC GACTATCTTT 3000 
wiTL TLFY LNL G H I. S W R L SF 

CTGGTTTATT TTACTATTGT TAGGCTTATT AGTCATTAAG CCAACTCTCT ATAAAAAACA 3060 
^ LLV IKP TLYK KQ 

ATTTATTTAT AGCTGGGAAG AGCGTATTAA GGATGGAATC ATTATCGTTA GTTTAATGGG 3120 

^ IKD Gil IVSL MG 

AGTTCTATTT TATATTGCAG GACTACTATT CCCTATCAGG GCTCATATTA CAGGTGGTAG 3180 
LFY IAGL LFP IRA HITG GS 

TATTGAACGC CTGCATTATA TCATAGCATG GGAGCCGATA GCATTGGCTA CGTTGATTCT 324 0 
IERL H Y I I AWE PIA LATL I L 

TACTCTCGTT TATTTATGTT TGGTTAAGAT TTTACAAGGA AAATCTTGTC AGATTGGTGA 

LCLV KIL QGK SCQI GD 

TGTGTTCAAT GTGGATCGTT ATAAAAAACT ACTTCAAGCT TACGGTGGTT CTTCGGATAG 3360 

UKYK KLL QAY GGSS D S 

CGGTTTAGCC^TTTTTAAATG ATAAAAGGCT CTACTGGTAC CAAAAAAATG GAGAAGATTG 3420 



3300 



3480 



CGTTGCGTTC CAATTTGTAA TTGTCAATAA TAAATGTCTT ATTATGGGGG AACCAGCCGG 

FVIV NNK CLI MGEP AG 

TGATGACACT TATATTCGTG AAGCTATTGA ATCGTTTATT GATGATGCTG ATAAGCTAGA 3540 

iREA I E S FID DADK LD 



CTATGACCTT GTTTTTTACA GTATTGGACA GAAGTTGACA CTACTTTTAC ATGAGTATGG 3600 
DLV F YSI GQK LTL LLHE YG 

TTTTGACTTT ATGAAAGTTG GTGAGGATGC TTTAGTTAAT TTAGAAACGT TTACTCTTAA 3660 
tM KVGE DAL VNL ETFT LK 
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AGGGAATAAG TACAAACCTT TCAGAAATGC CCTAAATAGA GTTGAAAAGG ATGGTTTCTA 3720 
GNKY KPFR N A L NRV EKDG FY 

TTTCGAAGTT GTACAATCGC CACATAGTCA AGAGCTACTA AATAGTTTGG AAGAGATTTC 37 8 0 
FEVV QSPH SQE LLN SLEE IS 

TAATACTTGG TTAGAAGGAC GTCCTGAAAA AGGTTTCTCA CTAGGATATT TTAATAAAGA 384 0 
NTWL EGRP EKG FSL GYFN K D 

TTATTTCCAA CAAGCCCCAA TAGCTTTGGT AAAAAATGCT GAACACGAAG TTGTTGCTTT 3 900 
YFQQ APIA LVK NAE HEVV A F 

TGCTAATATT ATGCCAAACT ATGAAAAGAG TATTATCTCT ATTGATTTAA TGCGTCACGA 3 960 
ANIM PNYE KSI I *S I DLMR HD 

TAAACAGAAA ATTCCGAATG GCGTTATGGA TTTCCTCTTT T TAT CAT TAT TCTCTTATTA 4 020 
KQKI PNGV MDF LFL SLFS YY 

TCAAGAGAAG GGATACCACT ATTTTGATTT GGGGATGGCA CCTTTATCAG GAGTTGGTCG 4 080 
QEKG YHYF DLG MAP LSGV GR 

CGTTGAAACA AGTTTTGCTA AAGAGAGAAT GGCGTATCTT GTCTATCATT TCGGTAGTCA 414 0 
VETS FAKE RMA Y* L V YHFG SH 

TTTCTACTCA TTTAATGGTT TACACAAGTA TAAGAAGAAG TTTACACCAT TGTGGTCGGA 4200 
FYSF NGLH KYK KKF TPLW SE 

ACGTTATATT TCTTGTTCTC GTTCGTCCTG GTTAATTTGT GCTATTTGTG CCCTATTAAT 4 260 
RYIS CSRS SWL ICA ICAL LM 

GGAAGATAGT AAAATTAAGA TTGTTAAATA AGCTTTATTT GGCAATTAAA AAGAGCATGT 4 320 
EDSK I K I V K 

CATGCGACAT GCTCTTTTTA AATCATTTAA TACCATTGAT TGCTTGAATC TACTTTATAA 4 380 

TATGATGTGC TTTTAAATAT TGTTTAGCTA CTGTAGCTGC TGATTTATGC TTTACAGCTA 4 440 

CTTGGTAGTT CATTTCTTGC ATTTCTTTTT CAGTGATATG ACCAGCAAGT T TAT T GAG AG 4 500 

CTTTTTTTAC TTGA (SEQ ID N0:1) 4514 

FIG. la 
[ clone 1-dna/ aa] 
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SGKEPANRFS WAKNKLLING FIATLAATIL FFAVQFIGLK PDYPGKTYFI 

ILLTAWTLMA LVTALVGWDN RYGSFLSLLI LLFQLGSSAG TYPIELSPKF 

FQTIQPFLPM TYSVSGLRET ISLTGDVNHQ WRMLVIFLVS SMILALLIYR 
KQED (SEQ ID NO: 2) 



50 
100 
150 
154 



FIG. lb 



MSTLTIIIAT LTALEHFYIM YLETLATQSN MTGKIFSMSK EELSYLPVIK 50 
LFKNQGVYNG LIGLFLLYGL YISQNQEIVA VFLINVLLVA IYGALTVDKK 100 
ILLKQGGLPI LALLTFLF (SEQ ID NO: 3) 118 



FIG. lc 



MTENWLHTKD 


GSDIYYRVVG 


QGQPIVFLHG 


NSLSSRYFDK 


QIAYFSKYYQ 


50 


VIVMDSRGHG 


KSHAKLNTIS 


FRQIAVDLKD 


ILVHLEIDKV 


ILVGHSDGAN 


100 


LALVFQTMFP 


GMVRGLLLNS 


GNLTIHGQRW 


WDILLVRIAY 


KFLHYLGKLF 


150 


PYMRQKAQVI 


SLMLEDLKIS 


PADLQHVSTP 


VMVLVGNKDI 


IKLNHSKKLA 


200 


SYFPRGEFYS 


LVGFGHHIIK 


QDSHVFNIIA 


KKFINDTLKG 


EIVEKAN 


247 


(SEQ ID NO: 


4) 














FIG. 


Id 
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MTUT 7DTTC\/ 


POT KSVFGOT. 


S ^MNLFLI IL 


VGVIAVLPTT 


GYDFVLNGLL 


50 


DTnVQlfPYTT 

Kl UrS.oCN.r\i lit 


I U 11 >ur X. I 1 * X X ll 


NLSGFGGLID 


IGLRMAFYGK 


KGQEKSDLRE 


100 


UTDTT DVT T 




IMSHIFHAKA 

^ X X kmJ X X *X> X» 4 4 X. wA X 


SVDYYYLVLI 


GASMYFPVIY 


150 




FGDMPSSTRI 


KLGVVSFFEW 


GCAAAAFIII 


GYLMGIHLPV 


200 


WTT PT rfTf^ 

1 1\ ± i_i it i-i r \_ x 


TAVGIVSLIP 

\_» ^1 V \J X V i_J XJ -L A. 


GGLGS FELVL 


FTGFAAEGLP 


KETVVAWLLL 


250 


YDT LYYTTPF 

I r\±j.ri x x x x it r 


FAGT YFFIHY 
c nu x xx x in i 


LGSOINORYE 


NVPKELVSTV 


LQTMVSHLMR 


300 


ILorir xjX l J 1 


AFFFNITYIM 


WLOKLGLDPL 


QEQMLWQFPG 


LLLGVCFILL 


350 


1 X U\il\ V x\lN 


AFPTAITWIT 

A x x x r\ x x * * x x 


LTLFYLNLGH 


ISWRLSFWFI 


LLLLGLLVIK 


400 


dtt vvynnv 
riLi JaI^v r X x 


RWFFRTKDGI 

»_? IMf Xj J— J XS. X i\L/W -X 


IIVSLMGVLF 


YIAGLLFPIR 


AHITGGSIER 


450 


T UVTTHWFPT 
Lill X X lnWIir X 


ALATLILTLV 

riux* x ±j x xj x xj v 


YLCLVKILOG 


KSCQIGDVFN 


VDRYKKLLQA 


500 


I boOO Uooiin 


FT MHKRT.YWY 


OKNGEDCVAF 


OFVIVNNKCL 


IMGEPAGDDT 


550 


i x Kcj/ixr-o r x 




VFYSIGOKLT 

v x x x u^ivxj x 


LLLHEYGFDF 


MKVGEDALVN 


600 


XjCj 111 XJ I\ v-3 L\ i\ 


YKPFRNALNR 


VEKDGFYFEV 


VOSPHSQELL 


NSLEEISNTW 


650 


LEGRPEKGFS 


LGYFNKDYFQ 


QAPIALVKNA 


EHEVVAFANI 


MPNYEKSIIS 


n a a 

7 00 


IDLMRHDKQK 


IPNGVMDFLF 


LSLFSYYQEK 


GYHYFDLGMA 


PLSGVGRVET 


750 


SFAKERMAYL 


VYHFGSHFYS 


FNGLHKYKKK 


FTPLWSERYI 


SCSRSSWLIC 


800 


AICALLMEDS 


KIKIVK (SEQ ID NO: 5) 






816 






FIG 


. le 







MRILGAFLIF 


STAFFENITY 


IMWLQKLGLD 


PLQEQMLWQF 


PGLLLGVCFI 


50 


LLARTIDQKV 


KNAFPIAIIW 


ITLTLFYLNL 


GHISWRLSFW 


FILLLLGLLV 


100 


IKPTLYKKQF 


I YSWEERIKD 


GIIIVSLMGV 


LFYIAGLLFP 


IRAHITGGSI 


150 


ERLHYIIAWE 


PIALATLILT 


LVYLCLVKIL 


QGKSCQIGDV 


FNVDRYKKLL 


200 


QAYGGSSDSG 


LAFLNDKRLY 


WYQKNGEDCV 


AFQFVIVNNK 


CLIMGEPAGD 


250 


DTYIREAIES 


FIDDADKLDY 


DLVFYSIGQK 


LTLLLHEYGF 


DFMKVGEDAL 


300 


VNLETFTLKG 


NKYKPFRNAL 


NRVEKDGFYF 


EVVQSPHSQE 


LLNSLEEISN 


350 


TWLEGRPEKG 


FSLGYFNKDY 


FQQAPIALVK 


NAEHEVVAFA 


NIMPNYEKSI 


400 


ISIDLMRHDK 


QKIPNGVMDF 


LFLSLFSYYQ 


EKGYHYFDLG 


MAPLSGVGRV 


450 


ETSFAKERMA 


YLVYHFGSHF 


YSFNGLHKYK 


KKFTPLWSER 


YISCSRSSWL 


500 


ICAICALLME 


DSKIKIVK 


(SEQ ID NO 


:6) 




518 






FIG 


. If 
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TAGTTTAGAC CGTATTGGGA TTATTGGTGT CAATGGAACT GGAAAGACAA CACTATTAGA 

XbI I G V NGT GKT T LLD 

TGTTATTTCG GGTGAATTAG GTTTTGATGG TGATCGTTCC CCTTTTTCAT CAGCTAATGA 



60 



120 



180 



240 



AATTTTGATA TCGAAACAAC AACTTTTGAG GCAATGAAAA AGCACGCGTC ATTATTGGAG 
> b I .r TFE AMKK HAS LLE 

AAAATATCTG TTGAGCGTTC TTTTATTGAA TTTGATAAAC TTCTATTAGC ACCTTATTGG 
CGTAAAGGAA TGCTGGCACT AATAGATAGT CATGCTTTTA ATTATCTACC ATGCTTaL 

AATAGGGAAT TACAATTAAG CGCCTTTTTG TCCCAGTTAG ATAAAGATTT TTTATTTGAG 

W^i> A F L SQLD KDF LFE 

ACAT C AG AAC AAGCTTGGGC ATCACTCATC TTGAGTATGG AAGTTGAACA CACAAAGACT 300 

o jl U S L I LSME V E H T K T 

TTTTTAAAAA AATGGAAGAC ATCAACTCAC TTTCAAAAAG ATGTTGAGCA TATAGTGGAT 360 

GTTTATCGTA TTCGTGAACA AATGGGATTG GCTAAAGAAC ATCTTTATCG TTATGGaL 

ACTATAA.TAA AACAAGCGGA AGGTATTCGC AAAGCAAGAG GCTTGATGGT TGATTTCGAA 
a J. K QAE GIR KARG L M V DFE 

AAAATAGAAC AACTAGATAG TGAGTTAGCA ATCCATGATA GGCATGAGAT AGTTGTCAAT 
a e. U LDS ELA IHDR H E I VVN 

GGTGGCACCT TAATCAAGAA ATTAGGAATA AAACCTGGTC CACAGATGGG AGATATTATC 

IKK L G I KPGP QMG D I I 

TCTCAAATTG AATTAGCCAT TGTTTTAGGA CAACTGA.TTA ATGAAGAAGA GGCTATTTTA 
CATTTTGTTA AGCAGTACTT GATGGATTAG AGAGGATTAT ATGAGCGATT TTTTAGTAGA 



420 



480 



540 



600 



660 



720 



TGGATTGACT AAGTCGGTTG GTGATAAGAC GGTCTTTAGT AATG^TTCAT TTATCATCCA 780 

rLl VFS NVSF IIH 



840 



900 



960 



1020 



TTATAAGATT GCTTATTTAA AACAAGAACC AGACTTTGAT GATTCTCAGA CAATTTTGGA 

QEP DFD DSQT ILD 

CACCGTACTT TCTTCTGACT TAAGAGAGAT GGCTTTAATT AAAGAATATG AATTATTGCT 

DL ALI KEYE LLL 

TAATCACTAC GAAGAAAGTA AGCAATCArr TOTtr.,-.,, ^ 

N H Y EE S K o SR TCTAGAGAAA GTAATGGCAG AAATGGATTC 1080 

usk LEK V M A E M 'DS 

TTTAGATGCT TGGTCTATTG AGAGCGAAGT CAAAACACTA m Trv t »„ ™~ 

LDA WSTF c v T, *-rtAAACAGTA TTATCCAAAT TAGGTATTAC 114 0 

^ SEV KTV LSKL GIT 

TGATTTGCAG TTGTCGGTTG GTGAATTATC AGGAGGATTA CGAAGACGTG TTCAATTAGC 1200 

tIjS GGL RRRV QLA 
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GCAAGTATTA TTAAATGATG CAGATTTATT GCTCTTAGAC GAACCTACTA ACCACTTAGA 1260 
Q VL L N D A DLL LLD EPTN HLD 

TATTGACACT ATTGCATGGT TAACGAATTT TTTGAAAAAT AGTAAAAAGA CAGTGCTTTT 1320 
IDT IAWL TNF LKN SKKT VLF 

TATAACTCAT GATCGTTATT TTCTAGACAA TGTTGCAACA CGTATTTTTG AATTAGATAA 1380 
ITH DRYF LDN VAT RIFE LDK 

GGCACAGATT ACAGAATATC AAGGCAATTA TCAGGATTAT GTCCGACTTC GTGCAGAACA 14 4 0 
AQI TEYQ GNY QDY VRLR AEQ 

AGACGAGCGT GATGCTGCTA GTTTACATAA AAAGAAACAG CTTTATAAAC AGGAACTAGC 1500 
DER DAAS LHK KKQ LYKQ ELA 

TTGGATGCGT ACTCAGCCAC AAGCTCGTGC AACGAAACAA CAGGCTCGTA TTAATCGTTT 15 60 
WMR TQPQ ARA TKQ Q A R I N R F 

T CAAAATCT A AAAAACGATT TACACCAAAC AAGCGATACA AGCGATTTGG AAATGACATT 1620 
QNL KNDL HQT SDT SDLE MTF 

TGAAACAAGT CGAATTGGGA AAAAGGTTAT TAATTTTGAA AATGTCTCTT TTTCTTACCC 1680 
ETS RIGK K V I NFE NVSF SYP 

AG AT AAAT C T ATCTTGAAAG ACTTTAATTT GTTAATTCAA AATAAAGACC GTATTGGCAT 17 40 
DKS ILKD FNL LIQ NKDR IGI 

CGTTGGAGAT AATGGTGTTG GAAAGTCAAC CTTACTTAAT TTAATTGTTC AAGATTTACA 1800 
VGD NGVG KST LLN LIVQ DLQ 

GCCGGATTCG GGTAATGTCT CTATTGGTGA AACGATACGT GTAGGTTACT TTTCACAACA 18 60 
PDS GNVS IGE TIR VGYF SQQ 

ACTTCATAAT ATGGATGGCT CAAAACGTGT TATTAATTAT TTGCAAGAGG TTGCAGATGA 1920 
LHN MDGS KRV INY LQEV ADE 

GGTTAAAACT AGTGTCGGTA CAACAAGTGT GACAGAACTA TTGGAACAAT TTCTCTTTCC 1980 
VKT SVGT TSV TEL LEQF LFP 

ACGTTCGACA CATGGAACAC AAATTGCAAA ATTATCAGGT GGTGAGAAAA AAAGACTTTA 204 0 
RST HGTQ I A K LSG GEKK RLY 

CCTTTTAAAA ATCCTGATTG AAAAGCCTAA TGTGTTACTA CTTGATGAGC CGACAAATGA 2100 
LLK I L I E KPN VLL LDEP TND 

CTTAGATATT GCTACATTAA CTGTTCTTGA AAATTTTTTA CAAGGCTTTG GTGGTCCTGT 2160 
LDI ATLT VLE NFL QGFG GPV 

GATTACAGTT AGTCACGATC GTTACTTTTT AGATAAAGTG GCTAATAAAA TTATTGCGTT 2220 
ITV SHDR YFL DKV ANKI IAF 

TGAAGATAAC GATATCCGTG AATTTTTTGG TAATTATACT GATTATTTAG AT G AAAAAG C 2280 
EDN DIRE FFG NYT DYLD EKA 

ATTTAATGAG CAAAATAATG AAGTTATCAG TAAAAAAGAG AGTACCAAGA CAAGTCGTGA 234 0 
FNE QNNE VIS KKE STKT SRE 

AAAGCAAAGT CGTAAAAGAA TGTCTTACTT TGAAAAACAA GAATGGGCGA CAATTGAAGA 24 00 
KQS RKRM SYF EKQ EWAT IED 

CGATATTATG ATATTGGAAA ATACTATCAC TCGTATAGAA AATGATATGC AAACATGTGG 24 60 
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GATTTAGATA AACCTGATAC AGCATATTCA GACCCTCACT TAGATCATTT GACCTCATAC 



MA Y I W 



2520 



DIM ILEN TIT RIE NDMQ TCG 

TAGTGATTTT ACAAGGTTAT CTGATTTACA AAAGGAATTA GATGCAAAAA ATGAAGCACT 
F DLQ KEL DAKN EAL 

TCTAGAAAAG TATGACCGTT ATGAGTACCT TAGTGAGTTA GACACATGAT TATCCGTCCG 2580 

^ 1 u SEL DTMI I R p 

ATTSTTMSA ATG„ «™»TTC «C J™ „ <0 



2700 



TACGAAAAAA TAGAGAAGTC AGGATTCTTT GTCATTGAGG AGAGAGATGA GATTATTGGC 2760 
KI GFF VIEE RDE I I G 

TGTGGCGGCT TTGGTCCGCT GAAAAATCTA ATTGCAGAGA TGCAGAAGGT GTACATTGCA 2820 
^ ^ r ^ P L KNL I A E M QKV Y I A 

GAACGTTTCC GTGGTAAGGG GCTTGCTACT GATTTAGTGA AAATGATTGA AGTAGAAGCT 2880 
^ KtR GKG L A T DLVK M I E V E A 



CGAAAAATTG GGTATAGACA ACTTTATTTA GAGACAGCCA GTACTTTGAG TAGGGCAACT 2940 



GCGGTTTATA AGCATATGGG ATATTGTGCC TTATCGCAAC CAATAGCAAA TGATCAAGGT 3000 
CATACAGCTA TGGATATTTG GATGATTAAA GATTTATAAG TTGAAAGTGG ATTAGTGAAC 
ATGGATTAAT TATTTTGAGA TAAGAGGAAA GAAAAGGAGA CATATATGGC ATATATTTGG 



3060 



3120 



TCTTATTTGA AAAGGTACCC CAATTGGTTA TGGCTTGATT TACTAGGAGC > TATGCTTTTT 3!80 

GTGACGGTTA TCCTAGGAAT GCCCACAGCC TTAGCGGGTA TGATTGATAA TGGCGTTACA 3240 

i- ^ M FTA LAGM IDN GVT 

AAAGGTGATC GGACTGGAGT TTATCTGTGG ACGTTCATCA TGTTTATATT TGTTGTACTA 3300 

GGTATTATTG GGCGTATTAC GATGGCTTAC GCATCTAGTC GCTTAACGAC AACAATGATT 
^ MAY ASSR LTT TMI 

AGAGATATGC GTAA.TGATAT GTATGCTAAG CTTCAAGAAT ACTCCCATCA TGAATATGAA 

Q E Y SHH E.YE 



3360 



3420 



CAGATAGGTG TATCTTCACT AGTGACACGT ATGACAAGCG ATACTTTTGT TTTGATGCAA 3480 

o b L VTR MTSD TFV LMQ 

TTTGCTGAAA TGTCTTTACG TTTAGGCCTA GTAACTCCTA TGGTAATGAT TTTTAGCGTG 3540 

i> ' LR LGL VTPM V M I FSV 

GTTATGATAC TAATTACGAG TCCATCTTTG GCTTGGCTTG TAGCGGTTGC GATGCCTCTT 3600 

A 1 ^ P b L AWLV AVA MPL 

TTGGTAGGAG TCGTTTTATA TGTAGCTATA AAAACAAAAC CTTTATCTGA AAGACAACAG 3660 

AI KTKP LSE RQQ 
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ACTATGCTTG ATAAAATCAA TCAATATGTT CGTGAAAATT TAACAGGGTT ACGCGTTGTT 3720 
TMLD KIN Q Y V RENL TGL RVV 

AGAGCCTTTG CAAGAGAGAA TTTTCAATCA CAAAAATTTC AAGTCGCTAA CCAACGTTAC 3780 
RAFA REN FQS QKFQ VAN QRY 

ACAGATACTT CAACTGGTCT TTTTAAATTA ACAGGGCTAA CAGAACCACT TTTCGTTCAA 3840 
TDTS TGL FKL TGLT EPL FVQ 

ATTATTATTG CAATGATTGT GGCTATCGTT TGGTTTGCTT TGGATCCCTT ACAAAGAGGT 3900 
I I I A MIV A I V WFAL DPL QRG 

GCTATTAAAA TAGGGGATTT AGTTGCTTTT ATCGAATATA GCTTCCATGC TCTCTTTTCA 3960 
AIKI GDL V A F IEYS F H A LFS' 

TTTTTGCTAT TTGCCAATCT TTTTACTATG TATCCTCGTA TGGTGGTATC AAGCCATCGT 4020 
FLLF ANL FTM YPRM VVS SHR 

ATTAGAGAGG TGATGGATAT GCCAATCTCT ATCAATCCTA ATGCCGAAGG TGTTACGGAT 4 080 
IREV MDM PIS INPN AEG VTD 

ACGAAACTTA AAGGGCATTT AGAATTTGAT AATGTAACAT TCGCTTATCC AGGAGAAACA 4140 
TKLK GHL EFD NVTF AYP GET 

GAGAGTCCCG TTTTGCATGA TATTTCTTTT AAAGCTAAGC CTGGAGAAAC AATTGCTTTT 4200 
ESPV LHD ISF KAKP GET IAF 

ATTGGTTCAA CAGGTTCAGG AAAATCTTCT CTTGTTAATT TGATTCCACG TTTTTATGAT 4260 
IGST GSG KSS LVNL IPR FYD 

GTGACACTTG GAAAAATCTT AGTAGATGGA GTTGATGTAA GAGATTATAA CCTTAAATCA 4 320 
V TLG KIL VDG VDVR DYN LKS 

CTTCGCCAAA AGATTGGATT TATCCCCCAA AAAGCTCTTT TATTTACAGG GACAATAGGA 4 380 
LRQK IGF I P Q KALL FTG TIG 

GAGAATTTAA AAT AT GG AAA AGCTGATGCT AC TAT T GAT G ATCTTAGACA AGCGGTTGAT 4440 
ENLK YGK ADA TIDD L RQ AVD 

ATTTCTCAAG CTAAAGAGTT TATTGAGAGT CACCAAGAAG CCTTTGAAAC GCATTTAGCT 4 500 
ISQA KEF IES HQEA FET H LA 

GAAGGTGGGA GCAATCTTTC TGGGGGTCAA AAACAACGGT TATCTATTGC TAGGGCTGTT 4 560 
EGGS NLS GGQ KQRL SIA RAV 

GTTAAAGATC CAGATTTATA TATTTTTGAT GATTCATTTT CTGCTCTCGA TTATAAGACA 4 620 
VKDP DL Y IFD DSFS ALD YKT 

GACGCTACTT TAAGAGCGCG TCTAAAAGAA GTAACCGGTG ATTCTACAGT TTTGATAGTT 4 680 
DATL RAR LKE VTGD STV L I V 

GCTCAAAGGG TGGGTACGAT TATGGATGCT GATCAGATTA TTGTCCTTGA TGAAGGCGAA 4740 
AQRV GTI MDA D Q I I VLD EGE 

ATTGTCGGTC GTGGTACCCA CGCTCAATTA ATAGAAAATA ATGCTATTTA TCGTGAAATC 4 800 
IVGR GTH AQL IENN A I Y REI 

GCTGAGTCAC AACTGAAGAA CCAAAACTTA TCAGAAGGAG AGTGATTGTA TGAGAAAAAA 4 8 60 
AESQLKNQNLSEGE MRKK 
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ATCTGTTTTT TTGAGATTAT GGTCTTACCT AACTCGCTAC AAAGCTACTC TTTTCTTAGC , 92 0 
GATTTTTTTG AAAGTTTTAT CTAGTTTTAT .GAGTGTTCTG GAGCCTTTTA TTTTAGGGTT „ 8 0 

A ™ cs ?m "ttt — r t ct ™ a 

ATTGAACGTT CCTTATATTG CTGGTATTTT GATTATTTAT TTTTTCAGAG GTGTTTTCTA 5100 

XJj 1IY FFRG V F y 

TGAATTAGGT TCTTATGGCT CAAATT (SEQ ID NO:7) 
ELGSYGSN 5126 



FIG. 2a 
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NFDIETTTFE 


AMKKHASLLE 


KISVERSFIE 


FDKLLLAPYW 


RKGMLALIDS 


50 


HAFNYLPCLK 


NRELQLSAFL 


SQLDKDFLFE 


TSEQAWASLI 


LSMEVEHTKT 


100 


FLKKWKTSTH 


FQKDVEHIVD 


VYRIREQMGL 


AKEHLYRYGK 


TIIKQAEGIR 


150 


KARGLMVDFE 


KIEQLDSELA 


IHDRHEIVVN 


GGTLIKKLGI 


KPGPQMGDII 


200 


SQIELAIVLG 


QLINEEEAIL 


HFVKQYLMD 


(SEQ ID NO: 


8) 


229 






FIG. 2b 








l\w V UUIM V X. «-? 


NVSFIIHSLD 


RIGIIGVNGT 


GKTTLLDVIS 


50 






AYLKQEPDFD 


DSQTILDTVL 


SSDLREMALI 


100 

JL w V/ 




jLj Hi rvv^o Ixxj^j rv 


VMAEMDSLDA 


WSIESEVKTV 




1 50 

X. «-/ VJ 






LNDADLLLLD 


EPTNHLDIDT 


TAWT.TNFLKN 


200 


CWTTJT FTTH 


np Y ITT nTJ\72\T 


RIFELDKAQI 


TEYQGNYQDY 


VRT.RAPDnFR 

V In. XJ r\_Tii_j \^ U l—i i\ 


? 50 
z_ \j 


r\7\ 7\ c t uvwn 


T V7HrT 7A TaTMP 
J_» 1 )S i ^Zj±j£\V l 4v , lr\ 


TQPQARATKQ 


QARINRFQNL 






o UliLiMX r Mo 


t\ X brvJ\ V J. iNJ r Ht 


NVSFSYPDKS 


ILKDFNLLIQ 


IN L\ U F\ A. ul V uU 


J Ju 


MP! 7/~" 7CTT T VI 


±j_L V vUljV^tr 


GNVSIGETIR 


VGYFSQQLHN 


i v i uo kj r\.r\ V X LN 1 


4 00 

*i w w 


LQEVADEVKT 


SVGTTSVTEL 


LEQFLFPRST 


HGTQIAKLSG 


GEKKRLYLLK 


450 


ILIEKPNVLL 


LDEPTNDLDI 


ATLTVLENFL 


QGFGGPVITV 


SHDRYFLDKV 


500 


ANKIIAFEDN 


DIREFFGNYT 


DYLDEKAFNE 


QNNEVISKKE 


STKTSREKQS 


550 


RKRMSYFEKQ 


EWATIEDDIM 


ILENTITRIE 


NDMQTCGSDF 


TRLSDLQKEL 


600 


DAKNEALLEK 


YDRYEYLSEL 


DT (SEQ ID NO: 9) 




622 






FIG. 2c 






MIIRPIIKND 


DQAVAQLIRQ 


SLRAYDLDKP 


DTAYSDPHLD 


HLTSYYEKIE 


50 


KSGFFVIEER 


DEIIGCGGFG 


PLKNLIAEMQ 


KVYIAERFRG 


KGLATDLVKM 


100 


IEVEARKIGY 


RQLYLETAST 


LSRATAVYKH 


MGYCALSQPI 


ANDQGHTAMD 


150 


IWMIKDL (SEQ ID NO:10) 






157 






FIG. 2d 
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MAYIWSYLKR YPNWLWLDLL GAMLFVTVIL GMPTALAGMI DNGVTKGDRT 
GVYLWTFIMF IFVVLGIIGR ITMAYASSRL TTTMIRDMRN DMYAKLQEYS 
HHEYEQIGVS SLVTRMTSDT FVLMQFAEMS LRLGLVTPMV MIFSVVMILI 
TSPSLAWLVA VAMPLLVGVV LYVAIKTKPL SERQQTMLDK INQYVRENLT 
GLRVVRAFAR ENFQSQKFQV ANQRYTDTST GLFKLTGLTE PLFVQIHAM 
IVAIVWFALD PLQRGAIKIG DLVAFIEYSF HALFSFLLFA NLFTMYPRMV 
VSSHRIREVM DMPISINPNA EGVTDTKLKG HLEFDNVTFA YPGETESPVL 
HDISFKAKPG ETIAFIGSTG SGKSSLVNLI PRFYDVTLGK ILVDGVDVRD 
YNLKSLRQKI GFIPQKALLF TGTIGENLKY GKADATIDDL RQAVDISQAK 450 
EFIESHQEAF ETHLAEGGSN LSGGQKQRLS IARAVVKDPD LYIFDDSFSA 500 
LDYKTDATLR ARLKEVTGDS TVLIVAQRVG TIMDADQII V LDEGEIVGRG 550 
THAQLIENNA I YREIAESQL KNQNLSEGE (SEQ ID NO: 11) 



50 
100 
150 
200 
250 
300 
350 
400 



579 



FIG. 2e 



MRKKSVFLRL WSYLTRYKAT LFLAIFLKVL SSFMSVLEPF ILGLAITELT 50 
ANLVDMAKGV SGAELNVPYI AGILIIYFFR GVFYELGSYG SN 92 
( SEQ ID NO:12) 



FIG. 2f 
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AATTTGGAAG TGCTCTATCA ACAGTTGAAG TAAAGGAGAT TATTAGTGAA GAAAACATAT 60 
FGS A L S TVEV KEI ISE ENIW 

GGTTATATCG GCTCAGTTGC TGCCATTTTA CTAGCTACTC ATATTGGAAG TTACCAACTT 120 
LYR LSC CHFT SYS YWK LPTW 

GGTAAGCATC ATATGGGTCT AGCAACAAAG GACAATCAGA TTGCCTATAT TGATGACAGC 180 

MGL ATK DNQI AYI DDS 
|— -> 

AAAGGTAAGG CAAAAGCCCC TAAAACAAAC AAAACGATGG ATCAAATCAG TGCTGAAGAA 24 0 

KGKA KAP KTN KTMD Q I S AEE 

GGCATCTCTG CTGAACAGAT CGTAGTCAAA AT TACT G AC C AAGGCTATGT GACCTCACAC 300 
GISA E Q I VVK ITDQ GYV TSH 

GGTGACCATT ATCATTTTTA CAATGGGAAA GTTCCTTATG ATGCGATTAT TAGTGAAGAG 360 
GDHY HEY NGK VPYD All SEE 

TTGTTGATGA CGGATCCTAA TTACCGTTTT AAACAATCAG ACGTTATCAA TGAAATCTTA 4 20 
LLMT DPN YRF KQSD V I N EIL 

I — > 

GACGGTTACG TTATTAAAGT CAATGGCAAC TATTATGTTT ACCTCAAGCC AGGTAGTAAG 4 80 
DGYV IKV NGN YYVY LKP GSK 

CGCAAAAACA TTCGAACCAA ACAACAAATT GCTGAGCAAG TAGCCAAAGG AACTAAAGAA 54 0 
RKNI RTK QQI AEQV AKG TKE 

GCTAAAGAAA AAGGTTTAGC TCAAGTGGCC CATCTCAGTA AAGAAGAAGT TGCGGCAGTC 600 
AKEK GLA Q V A HLSK E E V AAV 

AAT GAAGCAA AAAGACAAGG ACGCTATACT ACAGACGATG GCTATATTTT TAGTCCGACA 660 
NEAK RQG RYT TDDG YIF SPT 

GATATCATTG ATGATTTAGG AGATGCTTAT TTAGTACCTC ATGGTAATCA CTATCATTAT 720 
D I I D DLG DAY LVPH GNH YHY 

ATTCCTAAAA AGGATTTGTC TCCAAGTGAG CTAGCTGCTG CACAAGCCTA CTGGAGTCAA 780 
IPKK DLS PSE L A A A QAY WSQ 

AAACAAGGTC GAGGTGCTAG ACCGTCTGAT TACCGCCCGA CACCAGCCCC AGGTCGTAGG 84 0 
KQGR GAR PSD YRPT PAP GRR 

AAAGCCCCAA TTCCTGATGT GACGCCTAAC CCTGGACAAG GTCATCAGCC AGATAACGGT 900 
KAPI PDV TPN PGQG HQP DNG 

GGCTATCATC CAGCGCCTCC TAGGCCAAAT GATGCGTCAC AAAACAAACA C C AAAG AG AT 960 
GYHP APP RPN DASQ NKH QRD 

GAGTTTAAAG GAAAAACCTT TAAGGAACTT TTAGATCAAC TACACCGTCT TGATTTGAAA 1020 
EFKG KTF KEL LDQL HRL DLK 

TACCGTCATG TGGAAGAAGA TGGGTTGATT TTTGAACCGA CTCAAGTGAT CAAATCAAAC 108 0 
YRHV EED G L I FEPT Q V I KSN 

GCTTTTGGGT ATGTGGTGCC TCATGGAGAT CATTATCATA TTATCCCAAG AAGTCAGTTA 114 0 
AFGY VVP HGD HYHI IPR SQL 

TCACCTCTTG AAATGGAATT AGCAGATCGA TACTTAGCTG GCCAAACTGA GGACAATGAC 1200 
SPLE MEL ADR YLAG QTE DND 

TCAGGTTCAG AGCACTCAAA ACCATCAGAT AAAGAAGTGA CACATACCTT TCTTGGTCAT 1260 
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G S E H S K 



D K E V T H 



H 



CGCATCAAAG CTTACGGAAA AGGCTTAGAT GGTAAACCAT ATGATACGAG TGATGCTTAT 1320 

GTTTTTAGTA AAGAATGC f TGATTCAGTG GATAAATCAG GAG™ rlLL U1 . 

n s> v DKSG VTA K H G 

GATCATTTCC ACTATATAGG ATTTGGAGAA CTTGAACAAT ATGAGTTGGA TGAGGTCGCT 

\^ 

AACTGGGTGA AAGCAAAAGG TCAAGCTGAT rAfPTTrrTr 

N W V K A K G Q A n ^ AG ? TT ^ CT ^ CTGCTTTGGA TCAGGAACAA 1500 

v ^ u bLAA ALD QEQ 

GGCAAAGAAA AACCACTCTT TG AC ACT AAA AAAfTrarTr rv* 

G K E K P L F DTK c * GC AAAGTAAC AAAAGATGGT 1560 

iK KVSR K V T KDG 

AAAGTOGGCT ATATGATGCC AAAAGATGGT AAGGACTATT TCTATGCTCG TGATCAACTT 1620 
GATTTGACTC AGATTGCCTT TGCCGAACAA GAACTAATGC TTAAAGATAA GAAGCATTAC 1680 

CGTTATGACA TTGTTGACA.C AGGTATTGAG CCACGACTTG CTGTAGATGT GTCAAGTCTG 1740 

^ ^KLA VDV SSL 

CCGATGCATG CTGGTAATGC TACTTACGAT ACTGGAAGTT CGTTTGTTAT CCCACATATT 1800 

U TGSS F V I PHI 
GATCATATCC ATGTCGTTCC GTATTCATGG TTGACGCGCG ATCAGA.TTGC AACAGTCAAG i860 

TATGTGATGC AACACCCCGA AGTTCGTCCG GATGTATGGT CTAAGCCAGG GCATGAAGAG 1920 

TCAGGTTCGG TCATTCCAAA TGTTACGCCT CTTGATAAAC GTGCTGGTAT GCCAAACTGG 1980 

1 * N VTP LDKR AGM PNW 

CAAATTATCC ATTCTGCTGA AGAAGTTCAA AAAGCCCTAG CAGAAGGTCG TTTTGCAACA 2040 

^> a t t v Q KALA EGR FAT 

CCAGACGGCT ATATTTTCGA TCCACGAGAT GTTTTGGCCA AAGAAACTTT TGTATGGAAA 2100 

r R D VLAK ETF VWK 



GATGGCTCCT TTAGCATCCC AAGAGCAGAT GGCAGTTCAT TGAGAACCAT TAATAAATCT 2160 



I N K S 



GATCTATCCC AAGCTGAGTG GCAACAAGCT CAAGAGTTAT TGGCAAAGAA AAATACTGGT 2220 

U U A QELL A K K NTG 
GATGCTACTG ATACGGATAA ACCCAAAGAA AAGCAACAGG CAGATAAGAG CAATGAAAAC 2280 

CAACAGCCAA GTGAAGCCAG TAAAGAAGAA AAAGIUTCAG ATGACTTTAT AGACAGTTTA 2340 

CCAGACTATG GTCTAGATAG AGCAACCCTA GAAGATCATA TCAATCAATT AGCACAAAAA 2400 

IL E DHI NQL AQK 

GCTAATATCG ATCCTAAGTA TCTCATTTTC CAACCAGAAG GTGTCCAATT TTATAATAAA 2460 

QPEG VQ F Y N K 
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AATGGTGAAT TGGTAACTTA TGATATCAAG ACACTTCAAC AAATAAACCC TTAACCAAAA 2520 
NGEL VTY DIK TLQQ I N P 

GAAGATCTCA TTGTTAAAGC ACTGCTTTGT CAAAGCAAGT TACGGTGATT TTGAAGTCAT 2580 

TCTATGTAAC GAGTAGTGAT AAAAGTTGGA TAATAGCGGT TTTCTTTTGC AAAGAAATGG 2 640 

TATCCATGTT AGAATAGTAA AAAAAG AG G A GGATTCTTGG ACTAATGTCA AATAAGTAGA 2700 

CAGAAAACTG TGTTATTTTA TTGCGTTAAA ATAATTTTCT TCTTTCTGAT TAGGGGTTAG 27 60 

. K I A N F YNE EKQ NPTL 

TCCTAGATTA GCCGTATGTG GGTTGTAATT GTTATAAAAA TTCTCAATGT ATTCAAAGCA 2820 
GLN ATH PNYN NYF N E I YEFC 

GTCTAATTGA ACCTGTTTGA TATTTTGATA ATGTTTTCGG TTGATTTGTC TATGCTTTAA 2880 
DL'Q VQK INQY HKR N I Q RHKL 

AT ACT TG AAA AATGCTTCAG TTACGGCATT ATCATAAGGA TATCCAGGAT TAGAAAAAGA 2940 
YKF FAE TVAN DYP YGP NSFS 

ATGCATGATA TTGGCACTGC ACCCTAATAG TGAGACGCAA G AAAAAC AC T TTTAGGCAAT 3000 
H M A I 

< — | 

CAGTTTTCTG TACTGTACAG GCGACTGGTC GTTTAATCTC TGTTGAATTC TAGTTTCATT 3060 
LKR YQV PSQD NLR Q Q I RTEN 

AT AAAAT G T A ATGTAATTTT TAACAATATT TGTTATACTA TCTTTGTTGT ATTTTCTCCT 3120 
YFT IYN KVIN TIS DKN YKRR 

ATTATGGAAA TAAAAGGTTT CAGTCTTTAG GACGGTGTGA AACCATTCAA TACAGGCATT 3180 
NHF YFT ETKL VTH FWE ICAN 

ATCTGCAGGT GTTCCTTTTC GAGACATTGA GCGGATAATG TCTTTTTCCG TGCAAGCCTG 324 0 
DAP TGK RSMS RII DKE TCAQ 

GTAGTAAGCC ATAGAAGTAT ACACTGAGCC TTGGTCACTG TGTAAGATTG CTCCTTTATT 3300 
Y Y A M 
< I 

TAGGCAATTT TAACTGATTA AGGGTGTCTA GT AC AAAAT C CGTGTCCTGA CAATCTGAGA 3360 
KPLK LQN LTD LVFD TDQ CDS 

TAGTGTAAGC TATAATTTCT CGGTTATAGA GATTCATAAT T GAT GAG AG A TACAATTTAC 3420 
ITYA I I E RNY L N M I SSL YLK 

AGTTACCGAA ATATAGGTAG GTAATATCTG TTACGAGCTT TTCCTTAGGC TTATCGGCAT 34 80 
CNGF YLY TID TVLK EKP KDA 

GGAAATCCCG ACTCAATTTA TTATCTGTTA AATAATAAGC TTTACCCAAA TTGGGAACTT 354 0 
HGDR SLK NDT LYYA KGL NPV 

TCTTGGTACG TGTCCGACAA AGCCAGCCAT TATTTTTCAT GATACGATAG ACTTTCTTTG 3600 
KKTR TRC LWG NNKM IRY VKK 

TATTAACAGT CAATCCGTGG ATTTTTTTGA GCAATCGTGT AATGGTACGA TAGCCATAAA 3660 
TNVT LGH IKK LLRT ITR YGY 

TAAAGTGATT CTCCATACAG AGCTGTTCAA TTAATTCAAT AAGGTCATCT TTTTTTGCGG 3720 
I F H N EM 
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CTTCTCATAC TCCTTTTTCC AACGGTAATA GGTCGACCGC TTGACCTTAA AACAGTCTAG 
AATGAAAACT ATCGGGTAGT TGTTTTTATA GTCTTCCACA AGCTTGATAA GACTTACTTT 

ATCGATTTCC TTATCAAGCC TCGATACTTT TTTunrir^ „«. • K 

I S K R I L G r y 3 TTTAAGAGGT CAACCTGTAA TTGTAATTGT 

KLL DVQL Q LQ 

Tf? 0 srs^ -m* ? t t t f tt% c 3960 

TGAAAACGAT « CT C ™ If ^ ° 



3780 
3840 
3900 



4020 



4080 



TTTTjawc CTAAAGJCTC T JMC, CTGTTAGACT TGCCTGCTTT CTTCA-TATCG 

ATGCAAGCCA GCTTAGTTTC CCATGAATAT GCTTTTTTAA CCATAAtJ AC^TC^T 4!« 

x A K K V M 

TTCTAGTTTA CTAAATTTCA ACAGGAGTGT TTTTCTTTTG^CtIaTTTTA GGGATTCAGT 
GCCTATTGTT GTCATCAATT ATTTTTCTAA ATTCCCCGGA CTTAAATTGT GACCCTTGGT 
CGGAATGAAA GAGAAGTGTT CCTTCAATCT TTCTTTTATT AAGTGAAAAG GCAACACTTT 



4200 
4260 
4320 
4380 



4440 



4500 



TCTGTACAAC ATTTATAAAG TGTTTTTCTA rrr^Ti.,,. „ 

i^iiITTCTA GGCAATTAAT CTTTTAGTCA TTGGTGTTTG 

•AIL RKT MPTQ 
GTAGTTGAGA CTACCATGAA TGCGGTGGTA *TTCCACCAA TCAACATAGT CTTTAGTCTT 

AAGAGCTAGT TCTTCCAGCA AT TGAAAGGX TT c„ LLL TTTTGAAAGCT 
AG„A C^CAGCTA CGGCATTGTC ATAAGGAXAA CCAGCCrlc TAAGCgLg <S 6 0 

TGTGATTCCA AAGGCTTCCA ATATTTCATC AATTAACTGA TTATCAAACT 'JnLZ 

ILQ NDF EKGR 
ATCTGAATGG AAGATCTTGA CTTTGGTCAG GGCGTAAGGG ATGCTTTGTA TGGCTTGCTT « 80 

AACGAGTTCA GCGGTCTTGT GC CAACCAAG AGACAGGCCG ATGA^TCAC GGTTGTATAG 
GTCAATGATG AGGCAAACAT AAGCCCAAGG ATJGCGTACA CGAACATAGG TWGtLt 4,00 



4620 



4740 



GACTAAGGCT TGTAG.GGTC TTTCTTGCTT AAATTGCCTG TCTAAG.GGT TGGGAATAGG 4860 

ggcttcattc ttg™ aatgtggttt gaaggtggct ILL IgLccaI , 920 

* t a kqy vsvl 

attgagtcgc ttcataatgc gtcgaatccg acgacgtgaa agtgtgatac cttcgttat-t „„„ 

CAAGCATATT TTGArTTTTC TGGATCCGTA TCTAGACTCG CTATCGAGAA UJJ 5040 

1 RSE SDL FIRK 
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AATAGTTTCT TCAAACTCCG TTTCAGATAC TGACTCCACG GCTTGATAGT AATAACTTGA 5100 
I T E EFE TESV SEV AQY YYSS 

GTGTGGCATA TTCAGCCAGC GACACATCTT TGAAATGCTG TATTTATCCT TAT TAG C AG T 5160 
HPM NLW RCMK SIS YKD KNAT 

GATTATTTCC CTTTTTGTGC CATAATCACC GCTGCTTGCT TTAGGATATC TAATT 5215 
IIE RKT GYDG SSA KPY R I 

(SEQ ID N0:13) < I 

FIG. 3a 



FGSALSTVEV KEIISEENIW LYRLSCCHFT SYSYWKLPTW 4 0 

(SEQ ID N0:14) 

FIG. 3b 



MGLATKDNQI 


AYIDDSKGKA 


KAPKTNKTMD 


QISAEEGISA 


EQIVVKITDQ 


50 


GYVTSHGDHY 


HFYNGKVPYD 


AIISEELLMT 


DPNYRFKQSD 


VINEILDGYV 


100 


IKVNGNYYVY 


LKPGSKRKNI 


RTKQQIAEQV 


AKGTKEAKEK 


GLAQVAHLSK 


150 


EEVAAVNEAK 


RQGRYTTDDG 


YIFSPTDIID 


DLGDAYLVPH 


GNHYHYIPKK 


200 


DLSPSELAAA 


QAYWSQKQGR 


GARPSDYRPT 


PAPGRRKAPI 


PDVTPNPGQG 


250 


HQPDNGGYHP 


APPRPNDASQ 


NKHQRDEFKG 


KTFKELLDQL 


HRLDLKYRHV 


300 


EEDGLIFEPT 


QVIKSNAFGY 


WPHGDHYHI 


IPRSQLSPLE 


MELADRYLAG 


350 


QTEDNDSGSE 


HSKPSDKEVT 


HTFLGHRIKA 


YGKGLDGKPY 


DTSDAYVFSK 


400 


ESIHSVDKSG 


VTAKHGDHFH 


YIGFGELEQY 


ELDEVANWVK 


AKGQADELAA 


450 


ALDQEQGKEK 


PLFDTKKVSR 


KVTKDGKVGY 


MMPKDGKDYF 


YARDQLDLTQ 


500 


IAFAEQELML 


KDKKHYRYDI 


VDTGIEPRLA 


VDVSSLPMHA 


GNATYDTGSS 


550 


FVIPHIDHIH 


VVPYSWLTRD 


QIATVKYVMQ 


HPEVRPDVWS 


KPGHEESGSV 


600 


IPNVTPLDKR 


AGMPNWQIIH 


SAEEVQKALA 


EGRFATPDGY 


IFDPRDVLAK 


650 


ETFVWKDGSF 


SIPRADGSSL 


RTINKSDLSQ 


AEWQQAQELL 


AKKNTGDATD 


700 


TDKPKEKQQA 


DKSNENQQPS 


EASKEEKESD 


DFIDSLPDYG 


LDRATLEDHI 


750 


NQLAQKANID 


PKYLIFQPEG 


VQFYNKNGEL 


VTYDIKTLQQ 


INP 


793 


(SEQ ID NO 


: 15) 











FIG. 3c 
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MTDPNYRFKQ SDVINEILDG YVIKVNGNYY VYLKPGSKRK NIRTKQQIAE 
QVAKGTKEAK EKGLAQVAHL SKEEVAAVNE AKRQGRYTTD DGYIFSPTDI 
IDDLGDAYLV PHGNHYHYIP KKDLSPSELA AAQAYWSQKQ GRGARPSDYR 
PTPAPGRRKA PIPDVTPNPG QGHQPDNGGY HPAPPRPNDA SQNKHQRDEF 
KGKTFKELLD QLHRLDLKYR HVEEDGLIFE PTQVIKSNAF GYVVPHGDHY 
HIIPRSQLSP LEMELADRYL AGQTEDNDSG SEHSKPSDKE VTRTFLGHRI 
KAYGKGLDGK PYDTSDAYVF SKESIHSVDK SGVTAKHGDH FHYIGFGELE 
QYELDEVANW VKAKGQADEL AAALDQEQGK EKPLFDTKKV SRKVTKDGKV 
GYMMPKDGKD YFYARDQLDL TQIAFAEQEL MLKDKKHYRY DIVDTGIEPR 
LAVDVSSLPM HAGNATYDTG SSFVIPHIDH IHVVPYSWLT RDQIATVKYV 
MQHPEVRPDV WSKPGHEESG SVIPNVTPLD KRAGMPNWQI IHSAEEVQKA 
LAEGRFATPD GYIFDPRDVL AKETFVWKDG SFSIPRADGS SLRTINKSDL 
SQAEWQQAQE LLAKKNTGDA TDTDKPKEKQ QADKSNENQQ PSEASKEEKE 
SDDFIDSLPD YGLDRATLED HINQLAQKAN IDPKYLIFQP E GVQFYNKNG 
ELVTYDIKTL QQINP (SEQ ID NO: 16) 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
715 



FIG. 3d 



MHSFSNPGYP YDNAVTEAFF KYLKHRQINR KHYQNIKQVQ LDCFEYIENF 
YNNYNPHTAN LGLTPNQKEE NYFNAIK (SEQ ID NO: 17) 



50 
77 



FIG. 3e 



MAYYQACTEK DIIRSMSRKG TPADNACIEW FHTVLKTETF YFHNRRKYNK 50 
DSITNIVKNY ITFYNETRIQ QRLNDQSPVQ YRKLIA ( SEQ ID NO: 18, 86 



FIG. 3f 



MENHFIYGYR TITRLLKKIH GLTVNTKKVY RIMKNNGWLC RTRTKKVPNL 50 
GKAYYLTDNK LSRDFHADKP KEKLVTDITY LYFGNCKLYL SS IMNLYNRE 100 
IIAYTISDCQ DTDFVLDTLN QLKLPK (SEQ ID NO : 1 9 , 



126 
FIG. 3g 
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MVKKAYSWET KLACIDMKKA GKSNRVTMET LGIKNNSQIY TWMKWYENEE . 50 
LYRFHQGVGK QYTYGKGLEH LSEVEQLQLQ VDLLKKYRGL IRKSIK 96 
(SEQ ID NO:20) 

FIG . 3h 



IRYPKASSGD YGTKREIITA NKDKYSISKM CRWLNMPHSS YYYQAVESVS 50 

ETEFEETIKR IFLDSESRYG SRKIKICLNN EGITLSRRRI RRIMKRLNLV 100 

SVYQKATFKP HSRGKNEAPI PNHLDRQFKQ ERPLQALVTD LTYVRVGNRW 150 

AYVCLIIDLY NREIIGLSLG WHKTAELVKQ AIQSIPYALT KVKMFHSDRG 200 

KEFDNQLIDE ILEAFGITRS LSQAGYPYDN AVAESTYRAF KIEFVYQETF 250 

QLLEELALKT KDYVHWWNYH RIHGSLNYQT PMTKRLIA (SEQ ID NO:21)288 

FIG. 3i 



AATTTGAAAG CAGAATTATC TGTAGAAGAT 
N L K A ELS V E D 

TCTGCTCATG GTTCAACACC ACAAGAAGGT 
S A H G STP QEG 

CTAAGTCAAT TTGATTTTGA AGGTCCTGCT 
LSQF DFE GPA 

ATTCACGAAG ACTTCTCAGG TGAAAAACTT 
IHED FSG EKL 

CCATTGAGCA TGAATGCAGG TGTCTTCCAG 
PLSM NAG V F Q 

GCTCTTAATT TCCGTTACCC ACAAGGGACA 
A L N F RYP QGT 

AAACTTAACG GAGT TGAAAA AGTGACTCTT 
KLNG VEK VTL 

CCTATGGACG ATGAATTAGT ATCAACCTTA 
PMDD ELV STL 

AAAGGACATG AACAGGTTAT TGGTGGTGGG 
KGHE Q V I GGG 

GCATACGGTG CCATGTTCCC AGGAGATGAA 
A Y G A MFP GDE 

CCTTTAGAAA ATATTTTCCG TTCGGCTGCT 



GAGCAATATA CAGCAACAGT TTATGGTAAA 60 
EQYT A T V YGK 

GTTAATGGGG CGACTTATTT AGCTCTTTAT 120 
VNGA TYL ALY 

CGTGCTTTCT TAGATGTTAC AGCCAACATT 180 
RAFL DVT A N I • 

GGAGTAGCTT ATGAAGATGA CTGTATGGGA 24 0 
GVAY EDD CMG 

TTTGATGAAA CTAATGATGA TAATACTATC 300 
FDET NDD NTI 

GATGCTAAAA CTATCCAAAC TAAGCTTGAG 360 
DAKT IQT-KLE 

TCTGACCATG AACACACACC ACACTATGTA 420 
SDHE HTP HYV 

CTAGCTGTCT ATGAAAAGCA AACTGGTCTT 4 80 
LAVY EKQ TGL 

ACATTTGGTC GCTTACTTGA ACGGGGTGTT 54 0 
TFGR LLE RGV 

AACACTATGC ATCAAGCTAA TGAGTACATG 600 
NTMH Q A N EYM 

ATCTACGCAG AAGCTATCTA TGAATTAATC 660 
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N 



AAATAAAATA ATCCTTAAAC TAAATATGTG ATCAATGATA AAGGGTGGTG AAGACATGAA , 20 
AGTGTCTTTG CCTCTTTTCA TAAGGTTAGA TTTGGAGACT TTATGACTGA CTTGGAAAAA 780 

ATTATTAAAG CAATAAAAAG ^CACAG AA T GAAAA T , ^jj^ ^ 

v lN U n Y TEN G I D 
CCTTTG^G CTGCTCCTAA AACAGCTAGG f CAAT fT G ^GCCAAGC ACCTGGXTTA ,00 

AAAACTCAAG AAGCAAGACT ClArjGGAAA GATAAATCTG GA« T ACGCCAGTGG 960 
CTTrarc ATGAAGAGAC ATTTTACCAT TCTGGAAAAT TTGCTGTTTT LLL 7 l020 
TTTTATTACC CAGGCAAAGG AAAATCAGGA GATTTarrrr r-r „ 

F Y Y P G K G *^ CA G GA GATTTACCCC CTAGAAAAGG TTTTGCGGAG 1080 

r KKG F A E 
AAATGGCACC CTCTTATTTT AAAAGAAATG CCTAATGTTr aiT-rr»n™. 

KWHP LIL KEM P N 5 Q ^J 76 **™ GCTAGTTGGT 114 0 
CAGTATGCTC AGAAATATTA TCTTGGAAGC Trrrrnr-B-™ 

QYAQ K y Y L G S I ? u £ A^^TAAC AGAAACAGTT 1200 

° ^ n K NLT ETV 

AAAGCTTACA AAGACTATCT ACCCGATTAT TTnrrrrTr. ^ 

K A Y K D Y L ^ CC £ AT J AT TTACCCCTGG TTCACCCATC ACCGCGAAAT 1260 

^ p L V HPS PRN 

CAAATTTGGC TAAAGAAGAA TCCATGGTTT r^^,.,.. 

Q I W L K K N p w l GAAAAAGATC TAATCGTTGA TTTACAAAAG 1320 

k EKDL IVD LQK 

AT AG T AGC AG ATATTTTAAA AGATTAAGGA TAGGAGTTGG TATGAGAGAT AATCATCTAC 1380 

M R D N H L H 



ACACGTATTT TTCCTATGA, TGTCAAACGG CATTTGAGGA C^AAT GGTTTTACAG 1<4 „ 
GTGAATTTAT CACGACAGAA CATTTTGATT TATCAAATCC LLL LLIJ 150 0 



D V 



TTCGTGATTA ,„ „ »«»M« TAGATTATCT TAATCAGAAA TATGGAAATC l56 0 

1 D Y L NQK YGNR 
GATTTAAAAA AGGAATTGAA ATCGGTTATT TTAAAGATAG GGAATCAGAT ATTTTAGATT 1620 

ATTTAAAAAA TAAAGAATTT GATTTAAAAC TATTGTCAAT c«,LL LaGG^ 168 0 

^ LSI HHN GRYD 
ATTATCTGCA AGAAGAAGCT CTGAAAGTAC CAACAAAGGG AGCTTTTAGC AGATTACTTT 1 740 

AATCGTATGG AATTTGCCAT AGGCCGTGTG GAAGCGCACG TTTTAGCTCA CTTTGATTAT 1 800 
GGTTTTCGTA AGTTAAACTT AGATGTAGAA GATTTAAAAC CGTTTGAAAC GCAATTGAAG I860 

CGCATTTTCA TAAAGATGTT ATCTAAGGGG TTAGCTTTTG AftrTiaiiTnr , 

liAbCTTTTG AACTAAATAC CAAATCCCTT 1920 
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TATCTATATG GGAATGAAAA ACTTTATCGC TATGCTTTAG AGATACTCAA ACAGCTTGGT 1980 

TGTAAACAAT ACTCTATAGG CTCTGACGGT CATATTCCTG AACATTTTTG TTATGAATTT 204 0 

GATAGACTTC AAGGTCTGCT AAAGGACTAT CAAATTGATG AAAATCATTT GATATGAGGA 2100 

AATTTTTGAT AAAAAAGCTA GGCAATATTG CTTAGCTTTT TTGTAATGCT ATTGATAGTT 2160 

TTAGTGAAAA TTTCAAAAAA ATAAAGAAAT CATTTACTTG TTGCAAGCGC TTGCGTAAAT 2220 

TGTTATGATT TTATTGGTAA CAATTCATTA AAAAAGGAGA ATGATATGAA AAGAAAAGAC 2280 

M K R K D 
| > 

TTATTTGGTG ATAAACAAAC TCAATACACG ATTAGAAAGT TAAGTGTTGG AGTAGCTTCA 234 0 
L F G D KQT QYT IRKL SVG VAS 

GTTACAACAG GGGTATGTAT TTTTCTTCAT AGTCCACAGG TATTTGCTGA AGAAGTAAGT 24 00 
VTTG VCI FLH SPQV FAE EVS 

GTTTCTCCTG CAACTACAGC GATTGCAGAG TCGAATATTA ATCAGGTTGA CAACCAACAA 2 4 60 
V S P A TTA I A E SNIN Q V D NQQ 

TCTACTAATT TAAAAGATGA CATAAACTCA AACTCTGAGA CGGTTGTGAC ACCCTCAGAT 2520 
STNL KDD INS NSET VVT PSD 

ATGCCGGATA CCAAGCAATT AGTATCAGAT GAAACTGACA CTCAAAAGGG AGTGACAGAG 2580 
MPDT KQL VSD ETDT QKG VTE 

CCGGATAAGG CGACAAGCCT GCTTGAAGAA AATAAAGGTC CTGTTTCAGA T AAAAAT AC C 2 640 
PDKA TSL LEE NKGP VSD KNT 

TTAGATTTAA AAGTAGCACC ATCTACATTG CAAAATACTC CCGACAAAAC TTCTCAAGCT 2700 
LDLK VAP STL QNTP DKT S Q A 

ATAGGTGCTC CAAGCCCTAC CTTGAAAGTA GCTAATCAAG CTCCACGGAT TGAAAATGGT 27 60 
I G A P SPT LKV A N Q A PRI ENG 

TACTTTAGGC TACATCTTAA AGAATTGCCT CAAGGTCATC CTGTAGAAAG CACTGGACTT 2820 
YFRL HLK ELP QGHP VES TGL 

TGGATATGGG GAGATGTTGA TCAACCGTCT AGTAATTGGC CAAATGGTGC TATCCCTATG 28 80 
W I W G DVD QPS SNWP N G A I P " M 

ACTGATGCTA AG AAAG AT G A TTACGGTTAT TATGTTGATT TTAAATTATC TGAAAAACAA 2 94 0 
T D A K KDD YGY YVDF KLS EKQ 

CGAAAACAAA TATCTTTTTT AATTAATAAC AAAGCAGGGA CAAATTTAAG CGGCGATCAT 3000 
R K Q I SFL INN KAGT NLS GDH 

CATATTCCAT TATTACGACC TGAGATGAAC CAAGTTTGGA TTGATGAAAA GTACGGTATA 30 60 
HIPL LRP EMN QVWI DEK YGI 

CAT ACT TAT C AACCCCTCAA- AGAAGGGTAT GTCCGTATTA ACTATTTGAG TTCCTCTAGT 3120 
HTYQ PLK EGY VRIN YLS SSS 

AACTATGACC ACTTATCAGC ATGGCTCTTT AAAGATGTTG CAACCCCYTC AACAACTTGG 318 0 
NYDH LSA WLF K D V A TPS TTW 

CCAGATGGTA GTAATTTTGT GAATCAAGGA CTATATGGAA GGTATATTGA TGTATCACTA 324 0 
PDGS NFV NQG LYGR YID VSL 
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AAAACTAACG CCAAAGAGAT TGGTTTTCTA ATCTTAGATG AAAGTAAGAC AGGAGATGCA 3300 

T G D 



GTGAAAGTTC AACCCAACGA CTATGTTTTT AGAGATTTAG CTAACCATAA CCAAATTTTT 3360 

GTAAAAGATA AGGATCCAAA GCTTTATAAT AATCCTTATT ACA.TTGATCA AGTGCAGCTA 3420 

AAGGATGCCC AACAAATTGA TTTAACAAGT ATTCAAGCAA GTTTTACAAC TCTAGATGGG 3480 

^ J-'io -L Q A S FTT ~ 



L D G 



GTAGATAAAA CTGAAATTTT AAAAGAATTG AAAGTGACTG ATAAAAATCA AAATGCTATA 354 0 

CAAATTTCTG ATATCACTCT CGATACTAGT AAATCTCTTT TAATAATCAA AGGCGACTTT 3600 

AATCCTAAAC AAGGTCATTT CAACATATCT TATAATGGTA ACAATGTCAT GACAAGGCAA 3660 
* «nr w l b YNGN NVM TRQ 

TCTTGGGAAT TTAAAGACCA ACTTTATGCT TATAGTGGAA ATTTAGGTGC AGTTCTCAAT 3720 

v iSGN LGA V L N 

CAAGATGGTT CAAAAG T T G A AGCCAGCCTC TGGTCACCGA GTGCTGATAG TGTCACTATG 3780 
ATTATTTATG ACAAAGATAA CCAAAACAGG GTTGTAGCGA CTACCCCCCT T JL^ T 3840 

AATAAAGGTG TTTGGCAGAC GATACTTGAT ACTAAATTAG GTATTAAAAA CTATACTGGT 3 900 

v lijD TKLG I K N YTG 

TACTATTATC TTTACGAAAT AAAAAGAGGT AAGGATAAGG TTAAGATTTT AGATCCTTAT 3960 

i t i i\ R G KDKV K I L DPY 

GCAAAGTCAT TAGCAGAGTG G GAT AGT AAT ACTGTTAATG ATGATA.TTAA AACGGCTAAA 4 020 

GCAGCTTTTG TAAATCCAAG TCAACTTGGA CCTCAAAATT TAAGTTTTGC TAAAATTGCT 4080 

^ PQNL S F A K I A 

AATTTTAAAG GAAGACAAGA TGCTGTTATA TACGAAGCAC ATGTAAGAGA CTTCACTTCT 4140 

^ u avi YEAH VRD FTS 

GATCGATCTT TGGATGGAAA ATTAAAAAAT CAATTTGGTA CCTTTGCAGC CTTTTCAGAG 4200 

^ j\ n UFGT F A A FSE 

AAACTAGATT ATTTACAGAA ATTAGGAGTT ACACACATTC AGCTTTTACC GGTATTGAGT 4 2 60 

W i,^V THIQ LLP VLS 

TATTTTTATG TTAATGAAAT GGATAAGTCA CGCTCAACAG CTTACACTTC CTCAGACAAT 4 320 

AATTACAATT GGGGCTATGA CCCACAGAGC TATTTTGCTC TTTCTGGGAT GTATTCAGAG 4 380 

fyb YFAL SGM YSE 

AAACCAAAAG ATCCATCAGC ACGTATCGCC GAATTAAAAC AATTAATACA TGATATTCAT 4 440 

A ELKQ L I H DIH 
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AAACGTGGCA TGGGGGTTAT 
K R G M G V I 

TTTGAGGATA TAGAACCTAA 
F E D I E P N 

AGTTTTGGAG GGGGACGTTT 
S F G G G R L 

TCCATTAAAT ATCTTACAAG 
S I K Y L T S 

GAT CAT GAT G CGGCTGCGAT 
D H D A A A I 

AT GAT TAT G A TTGGTGAGGG 
M I M I G E G 

CCAGCTGACC AAGATTGGAT 
P A D G D W M 

CGTAATAGCT TGAAATCTGG 
R N S L K S G 

CC AC AATCTT TACAAGGTAT 
P Q S L Q G I 

GATTCGCCAG GAGATGTGGT 
D S P G D V V 

GTGATTGCAA AATCAATT 
V I A K SI. 



ACTTGATGTC GTCTATAATC 
L D V V Y N H 

TTATTATCAC TTTATGAATG 
Y Y H F M N E 

AGGAACCACT CATGCAATGA 
G T T HAMS 

TGAATTTAAA GTTGATGGTT 
E F K V D G F 

TGAATTAGCT TATAAAGAAG 
E L A Y K E A 

CTGGAGAACA TTCCAAGGCG 
W R T F Q G D 

GAAGTCAACC GATACAGTTG 
K S T D T V G 

TTTTCCAAAT GAAGGTACTC 
F P N E G T P 

TTTTAAAAAT ATCAAAGCAC 
F K N I K A Q 

GCAGTATATT GCTGCACATG 
Q Y I A A H D 

<SEQ ID NO:22) 



ACACTGCAAA AACTTATCTC 4 500 
T A K T Y L 

AAGATGGTTC ACCAAGAGAA 4 560 
D G S PRE 

GTCGTCGTGT TTTGGTTGAT 4 620 
R R V LVD 

TCCGTTTTGA TATGATGGGA 4 680 
R F D M M G 

CTAAAGCTAT TAATCCTAAT 4 74 0 
K A I N P N 

ATCAAGGTCA GCCGGTTAAA 4 800 
Q G Q P V K 

GCGTCTTTTC AG AT GAT ATT 4 8 60 
V F S DDI 

CAGCTTTCAT CACAGGTGGC 4 920 
A F I T G G 

AACCTGGGAA TTTTGAAGCA 4 980 
P G N F E A 

ATAACCTTAC CTTGCATGAT 504 0 
N L T L H D 

5058 



FIG. 4a 



NLKAELSVED EQYTATVYGK SAHGSTPQEG VNGATYLALY LSQFDFEGPA 50 

RAFLDVTANI IHEDFSGEKL GVAYEDDCMG PLSMNAGVFQ FDETNDDNTI 100 

ALNFRYPQGT DAKTIQTKLE KLNGVEKVTL SDHEHTPHYV PMDDELVSTL 150 

LAVYEKQTGL KGHEQVIGGG TFGRLLERGV AYGAMFPGDE NTMHQANEYM 2 00 

PLENIFRSAA IYAEAIYELI K (SEQ ID NO: 23) 221 

FIG. 4b 



MTDLEKIIKA IKSDSQNQNY TENGIDPLFA APKTARINIV GQAPGLKTQE 50 

ARLYWKDKSG DRLRQWLGVD EETFYHSGKF AVLPLDFYYP GKGKSGDLPP 100 

RKGFAEKWHP LILKEMPNVQ LTLLVGQYAQ KYYLGSSAHK NLTETVKAYK 150 

DYLPDYLPLV HPSPRNQIWL KKNPWFEKDL IVDLQKIVAD ILKD 194 
{SEQ ID NO:24) 

FIG. 4c 
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MRDNHLHTYF SYDCQTAFED YINGFTGEFI TTEHFDLSNP YTGQDDVPDY 50 

SAYCQKIDYL NQKYGNRFKK GIEIGYFKDR ESDILDYLKN KEFDLKLLSI 100 

HHNGRYDYLQ EEALKVPTKG AFSRLL (SEQ ID NO:25) 126 

FIG. 4d 



MKRKDLFGDK QTQYTIRKLS VGVASVTTGV CIFLHSPQVF AEEVSVSPAT 50 

TAIAESNINQ VDNQQSTNLK DDINSNSETV VTPSDMPDTK QLVSDETDTQ 100 

KGVTEPDKAT SLLEENKGPV SDKNTLDLKV APSTLQNTPD KTSQAIGAPS 150 

PTLKVANQAP RIENGYFRLH LKELPQGHPV ESTGLWIWGD VDQPSSNWPN 200 

GAIPMTDAKK DDYGYYVDFK LSEKQRKQIS FLINNKAGTN LSGDHHIPLL 250 

RPEMNQVWID EKYGIHTYQP LKEGYVRINY LSSSSNYDHL SAWLFKDVAT 300 

PSTTWPDGSN FVNQGLYGRY IDVSLKTNAK EIGFLILDES KTGDAVKVQP 350 

NDYVFRDLAN HNQIFVKDKD PKVYNNPYYI DQVQLKDAQQ IDLTSIQASF 4 00 

TTLDGVDKTE ILKELKVTDK NQNAIQISDI TLDTSKSLLI IKGDFNPKQG 450 

HFNISYNGNN VMTRQSWEFK DQLYAYSGNL GAVLNQDGSK VEASLWSPSA 500 

DSVTMIIYDK DNQNRVVATT PLVKNNKGVW QTILDTKLGI KNYTGYYYLY 550 

EIKRGKDKVK I LDP YAKS LA EWDSNTVNDD IKTAKAAFVN PSQLGPQNLS 600 

FAKIANFKGR Q DA VI YE AH V RDFTSDRSLD GKLKNQFGTF AAFSEKLDYL 650 

QKLGVTHIQL LPVLSYFYVN EMDKSRSTAY TSSDNNYNWG YDPQSYFALS 7 00 

GMYSEKPKDP SARIAELKQL IHDIHKRGMG VILDVVYNHT AKTYLFEDIE 750 

PNYYHFMNED GSPRESFGGG RLGTTHAMSR RVLVDSIKYL TSEFKVDGFR 8 00 

FDMMGDHDAA AIELAYKEAK AINPNMIMIG EGWRTFQGDQ GQPVKPADQD 850 

WMKSTDTVGV FSDDIRNSLK SGFPNEGTPA FITGGPQSLQ GIFKNIKAQP 900 

GNFEADSPGD VVQYIAAHDN LTLHDVIAKS I (SEQ ID NO: 26) 931 



FIG. 4e 
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AATTCAAAGT TTGACAGAAG GTCAACTTCG TTCTGATATC CCTGAGTTCC GTGCTGGTGA 60 
I Q S LTEG QLR SDI PEFR AGD 

TACTGTACGT GTTCACGCTA AAGTTGTTGA AGGTACTCGC GAACGTATTC AGATCTTTGA 120 
TVR VHAK VVE GTR ERIQ I F E 

AGGTGTTGTT ATCTCACGTA AAGGTCAAGG AATCTCAGAA ATGTACACAG TACGTAAAAT 180 
GVV ISRK GQG ISE MYTV R K I 

TTCTGGTGGT ATCGGTGTAG AGCGTACATT CCCAATTCAC ACTCCTCGTG TTGATAAAAT 24 0 
SGG IGVE RTF P I H TPRV D K I 

CGAAGTTGTT CGTTATGGTA AAGTACGTCG TGCTAAACTT T ACT ACT T AC GCGCATTGCA 300 
E V V RYGK VRR A K L YYLR ALQ 

AGGTAAAGCT GCACGTATTA AAGAAATCCG TCGTTAATTT TGATGATCAG ATTTTAAAAA 360 

TGCTTGGTTG TTTGAGGATA GTAACTATGT TTTAAAACTG GACAACCAAG ACGTAAAAAA 420 

TCTGCCTGTG GGCAGTTTTT TTACTAGGTC CCCTTAGTTC AATGGATATA ACAACTCCCT 4 80 

. H I Y C S G 

CCTAAGGAGT AATTGCTGGT TCGATTCCGG CAGGGGACAT ATTCATTGCA TGTAAATAGC 54 0 
GLS YNST RNR CPV YENC TFL 

GGTTTAGAGC TATTTTGCCC CAAATTTCTC TGATTAAGTT TATCGTTCCT ATCTTTTTGT 600 
PKS SNQG LNR QNL KDNR DKQ 

TCTTGTAATT GATGTGCGTA AACTTCTAAA GTGATATTTA AATTCTCGTG ATCTAAAACT 660 
EQL QHAY VEL TIN LNEH DLV 

TGAGAGATGG AAATTAGATA GCTTGCAAAT GTATGCCTGA GAGAGTGCAC TCGTACCTCG 720 
Q S I SILY SAF THR LSHV RVE 

CGACCAGTTA TTTTTCGGAT AGTTTTATTG ACTGCATTAT TTGAAAGTTT GTCGAATAAT 7 80 
RGT I K R I TKN VAN NSLK DFL 

CTGTCGTTTT TATTTTTTGT AAATTCATGC AAAAAAAATA AT G TAT CAT T GTCAATTGGT 840 
RDN KNKT FEH LFF L-TDN DIP 

ATATTTCTGA TACTACTTTT GTTTTTTGTT GGCAGGTATC TTTGGTTGAA ATGATAATCC 900 
I N R ISSK NKT PLY RQNF HYD 

CAAGTTTTAT TAATTGATAA ATATTTGTTA GTGTAATCAA TATCATTAAC TGTTAAACCT 960 
WTK NISL YKN TYD IDNV TLG 

AAACATTCAG CGAAGCGCAT GCCAGTTTTA GCGATGAGGT ATAACGCTGC ATACGATTGA 1020 
L C E A F R M 



TGTTGTGATT TTTCTTTACA AATTTTTATC AAGCGTAAGT ATTCATTGGT TTCAAGAAAT 1080 

TTTATCTCTA TTTACGCCCC TTATTTTTTG CTTTAACCTT AG T G AAT AAA CAAAAATTTT 114 0 

TTTCTATATA TCCCTCGTGA ACAGCCATGG ATACGCAGGC TTTTACATGT ATGTTAAAAC 1200 

GCTTTACTGT ATCTTGCACA TGCGTTTGAC TATAATGATT TATGACTTGT TGATATTTAG 12 60 
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TGGAAGTAAT ATTGCAAAGT AATATATTTC CTATTATATG TTTATACGAT ATTCGATATT 1320 

CCCACCCGTT GTCGCGTTTA CGGAAATACG CCATTGATAT ACTCCACATT AGCTAAAGAA 1380 

CAGGGTGTTC AAGGCTACCT TGATGGAAAA GGCTCTCTTA GAGATATTTG TAAATGGTAT 1440 

GATATCTCAA GTCGCTCTGT TCTCCAAAAG TGGATAAAAC GGTATACTAG TGGTGAAGAC 1500 

TTGAAAGCCA CTAGTAGAGG ATATAGCCGT ATGAAACAAG GAAGGCAAGC CACATTTGAA 1560 

GAACGTGTAG AGATTGTTAA CTACACCATT GCCCATGGGA AAGACTATCA AGCAGCTATT 1620 

GAGAAGTTTG GTGTTTCCTA CCAACAAATT TATTCTTGGG TGCGTAAGCT TGAGAAGAAT 1680 

GGCTCACAAG GTTTGGTTGA TAGACGTGTG AAAGGGTTGG AGAGTAGGCC TGATTTAACC 1740 

GAGATTGAGC AACTTTAACT CAAGATTAAA CAATTGGAGG AACGTAATCG TCTCTTAGAA 1900 

ATCGAGGTTA GTTTACTAAA AAAGTTAGAA GACATCAAAC GAGGAAACAG ACGGTAAGAC 1860 

TAGGTAAGCA TTTAGCGGAG TTCCAAGTAA TCAAGAATTA TTACGATGAG GAATCTAATG 1920 

TGCCTATTCA GGCCTTATGC CAACTCTTGA AGGGGTCTCG TTCAGGCTAT TACAAGTGGC 1980 

TCAATCGTCA AAAAACAGAT TTTGAGACAA AAAATACAAA GCTAATGGCT AAAATCAAGG 2040 

AACTTCGTAG ACTCTACAAT GGTATCTTAG GTTATCGCCG TATGACAACA TTTATTAATC 2100 

GTCAACTTGG GACAACTTAA AACAAGAAAC GGATTCGTTG ATTGATGAAC ATTCTGGGGA 2160 

TTAGTTCAGT CATTCGTCGT GTTAGCCATG CTTGTACAAA AGCTGGTGAC AGATTTTACG 2220 

AAGAAAATAT TCTTAATCGT GAATTTACAG CCACAGCTCA TAACCAGAAA TGGTGCACAG 2280 

ATGTCACCTA TCTTCAATAC GGTCTGGGAG CTAAAGCTTA TCTCAGTGCG ATTAAAGACC 2340 

TGTATAACGG TTCTATTATC GCTTATGAGA TTAGTCACAA CAATGAAATC CACTTGTTAT 24 00 

GAAGACCATT AAAAAGGGGC TAGAGCTCAA TCCAGGAGCC ACACCTATCA TCCATAGCGA 24 60 

TTGAGGTAGT CAATATACTT CCAAAGAATA CCGTTATATC ATACAACAAG CTGGTCTGAC 2520 

CTTATCCATG TCCCGGATTG GCAAATGTAT TGATAATGCA CCAACTGAAA GTTTCTTTGG 2580 

GTTTTTCAAG ACTGAGTCTT ACCACCTTAA GAAATACAAC TCTTATGATG AGTTGGTCAA 2 640 

TGATGTGGCA CGTTATATCG AATTCTACAA CACACAACGT TATCAATCAA AATTAAACAA 2700 

CCTGACTCCT CTAGAATTCA GGAATCAGGT TGCATAACTT ATCTTTTATT ATTTGACTGT 2760 

CTACTTGACA GGGAGCCGTT CAGATTGCTT AACCTTTCTA AATTTGCTAA AAT AG CT AC A 2820 

AGAAAACGAG CCATTTAATG CTTATTTCTT ATACTGTCTT GCCTCACGCT CTCCTCGACC 2880 

AAAAATTGAG CGTGAGGCTT TTTGTTTCAT TAAACG AT GA TATTTCCATA TTCATCAGTT 2 940 

TGTTTTCCGA GAGCCATCAA AGCTTCGATA AGGTCGATAA TTCCAGGAAT AAAGG T AAT A 3000 

CTAAAAATAA TATATAAAAA AACCTGGCCT ATTTTTCCTG CGTAAAATTT ATGCGCTCCA 3060 

ATGCCGCCCA AAAGAACGTT AATAAAACAT AAACTACTAT GTTAGCATAA GACTTTATTT 3120 
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TTACAACTGA ATTTCATATA AATGGATTAG AGTAAGGGAT AAAAGAAATT AGCATAGCTC 3180 

TTTTGAAAAT AAAAAAATTA ATATAATATG GAAAAAATTT TATTTCATAA ACGTTTCATA 324 0 

AAAGGTATGT AATCTAGTAT TTAGGCAACA CTATTTTGTC ACTGGTGTCT AGTAACTTAT 3300 

AGATTGATAA TTTTACTAGT AAACGTAATT CTTCGCTTTA AGAGTTAAAT GTCTATTTAT 3360 

TGTAAGCTAA ATTGGGAGGT GAACTTATGT AAAATTAGAT AGGTACTGTC AAGTACGGGA 34 20 

TGATTATTGA AACAGCCAGT AT GC AT CAT A AAATCTGTAT TGCTTAATAA CTATTTCCTT 34 80 

AACCAGACAT CAGTTCATTG TTTATCATCG CTACCCTAAG TCTAGTTTTT TCAATAGAGC 354 0 

ATTAGGTAGT TTTTGATAAT AAAACTATAT AAACATGAGA ATTAGATTTC GTATTGCATT 3600 

CTTCATAATG AGTTATTTGA GATTTTCCTT TGAATAAATA GATACGAAAT TCAGTAACTT 3660 

CATATATAAA CGGCTCTATC ATTGAGATAG TTTGTCAAAT GAAGAAATTT TTAATGGAAA 3720 

TAGTTTTAAA AACATTAGTT GTAGGCGATG TAAAAATATT AATCCAGTGG ATGCAATAGT 3780 

TGCGGAGTAA AAATAGAGAG GAGTAATTAG GAAGTGATAA AAAATGCTAT AG CAT AT ATT 384 0 

ACCAGAAAAA AAAATAGAAC ACTTATTATA TTTGCTATTT TAACAATTGT TCTTTCTTGC 3 900 

TTGTATTCAT GTTTAACAAT AATGAAATCA AG T AAT G AAA TAGAAAAGGC TTTATATGAA 3960 

MKS SNEI EKA LYE 
| > 

AGTTCTAATT CTTCAATATC AATTACAAAA AAAGATGGTA AATATTTTAA TATTAATCAA 4 020 
S S N S SIS ITK KDGK YFN I N Q 

TTTAAGAATA TTGAAAAAAT AAAAGAGGTT GAAGAAAAAA TATTTCAATA TGATGGATTA 4 080 
FKNI EKI KEV EEKI FQY DGL 

GCAAAATTGA AAGATCTTAA AGTAGTTAGT GGTGAGCAAA GTATAAATAG AGAAGATTTA 414 0 
AKLK DLK VVS GEQS INR EDL 

TCTGACGAAT TTAAAAATGT TGTTTCACTA GAAGCTACAA GTAATACTAA AAGAAATCTT 4200 
SDEF KNV VSL EATS NTK RNL 

TTATTTAGTA GTGGAGTATT TAGTTTTAAA GAAGGAAAAA ATATAGAAGA AAATGATAAG 4 260 
L F S S GVF SFK EGKN IEE NDK 

AATTCAATTC TTGTTCATGA AGAATTTGCT AAACAAAACA AACTAAAATT GGGTGATGAA 4 320 
N S I L VHE EFA KQNK LKL GDE 

ATTGATCTTG AATTACTAGA TACGGAAAAA AGTGGAAAAA TAAAAAGTCA TAAATTTAAA 4 380 
IDLE LLD TEK SGKI KSH KF K 

ATTATAGGAA TCTTTTCTGG TAAAAAACAG GAAACATATA CAGGATTATC ATCTGATTTT 4440 
I I G I FSG KKQ ETYT GLS SDF 

AGCGAAAATA TGGTTTTTGT AGATTATTCA ACTAGCCAAG AAATATTAAA TAAATCAGAG 4 500 
SENM VFV DYS TSQE ILN KSE 

AATAATAGAA TTGCAAATAA AATTTTAATG TATTCTGGTA GTTTAGAATC TACAGAGCTT 4 560 
NNRI ANK ILM YSGS LES TEL 

GCCTTAAACA AATTGAAAGA CTTTAAAATT GATAAGTCAA AGTATTCTAT TAAGAAAGAT 4 620 
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A L N K LKD FKI DKSK Y S I KKD 

AATAAAGCAT TCGAAGAGTC TTTAGAGTCA GTGAGTGGAA TAAAACATAT AATTAAAATA 4 680 
N K A F EES LES VSGI KHI IKI 



ATGACTTATT CGATTATGTT AGGTGGAATA GTTGTTCTTT CATTAATCTT GATTCTATGG 4 74 0 

GGI VVLS L I L ILW 

TTAAGAGAAA GAATTTATGA AATAGGTATA TTTTTATCTA TTGGAACAAC TAAGATACAA 4 800 

IGI FLSI GTT KIQ 

ATTATAAGGC AATTTATATT TGAGTTAATA TTCATATCAA TACCAAGTAT AATATCCTCC 4 860 
ilR Q F I F ELI FISI PSI ISS 

TTATTTTTAG GGAATCTACT ATTAAAAGTA ATTGTAGAAG GATTTATTAA CTCAGAGAAC 4 920 
LFLG NLL LKV IVEG FIN SEN 

TCAATGATTT TCGGTGGAAG TTTAATAAAT AAAAGCAGTT TTATGTTAAA CATAACAACA 4 980 
SMIF GGS LIN KSSF MLN ITT 

CTTGCAGAAA GTTATTTAAT ATTAATAAGT ATTATTGTTT TATCAGTTGT AATGGCCTCT 504 0 
LAES Y L I LIS I I V L SVV MAS 

TCATTAATAT TATTTAAGAA ACCACAAGAA ATATTATCAA AAATAAGTTA GGAGCAAATA 5100 
bLIL FKK PQE ILSK IS. 

ATGGATATAT TAGAAATAAA GAATGTAAAT TACAGTTACG CAAATTCTAA AGAAAAAGTT 5160 
M D I L EIK NVN YSYA NSK EKV 



TTGTCAGGAG TAAATCAAAA ATTTGAACTT GGAAAGTTTT ATGCGATAGT AGGGAAGTCA 5220 
LSGV NQK FEL GKFY AIV GKS 



GGAACAGGAA AATCCACACT TCTTTCCTTA CTTGCAGGAC TTGATAAAGT T C AAACAGG A 5280 
GTGK STL LSL LAGL DKV QTG 

AAAATCTTGT TTAAGAATGA AGATATAGAA AAGAAAGGAT ATAGTAATCA CAGAAAAAAT 534 0 

KNE DIE KKGY SNH RKN 

AATATATCTT TGGTATTTCA AAATTATAAT TTAATAGATT ATTTATCGCC GATTGAAAAT 5400 
NISL VFQ NYN LIDY LSP IEN 

ATTAGACTAG TAAATAAATC AGTAGATGAG AGTATCTTGT TCGAATTAGG T T T AGAT AAA 54 60 
IRLV NKS VDE SILF ELG LDK 

AAACAAATAA AAAGAAATGT TATGAAATTA TCTGGTGGTC AGCAACAAAG GGTAGCTATT 5520 
K< 2IK RNV MKL SGGQ QQR V A I 

GCTAGGGCAC TGGTATCAGA TGCCCCAATA ATACTAGCTG ATGAGCCTAC CGGTAACCTA 5580 
ARAL VSD API ILAD EPT GNL 

GACAGTGTTA CTGCTGGAGA AATAATT (SEQ ID NO-27> 
DSVT AGE II. ' ' 

FIG. 5a 
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IQSLTEGQLR SDIPEFRAGD TVRVHAKVVE GTRERIQIFE GVVISRKGQG 50 

ISEMYTVRKI SGGIGVERTF PIHTPRVDKI EVVRYGKVRR AKLYYLRALQ 100 

GKAARIKEIR R (SEQ ID NO:28) 111 

FIG. 5b 

MRFAECLGLT VNDIDYTNKY LSINKTWDYH FNQRYLPTKN KSSIRNIPID 50 

NDTLFFLHEF TKNKNDRLFD KLSNNAVNKT IRKITGREVR VHSLRHTFAS 100 

YLISISQVLD HENLNITLEV YAHQLQEQKD RNDKLNQRNL GQNSSKPLFT 150 

CNEYVPCRNR TSNYSLGGSC YIH (SEQ ID NO: 29) 173 

FIG. 5c 

MKSSNEIEKA LYESSNSSIS ITKKDGKYFN INQFKNIEKI KEVEEKIFQY 50 

DGLAKLKDLK VVSGEQSINR EDLSDEFKNV VSLEATSNTK RNLLFSSGVF 100 

SFKEGKNIEE NDKNSILVHE EFAKQNKLKL GDEIDLELLD TEKSGKIKSH 150 

KFKIIGIFSG KKQETYTGLS SDFSENMVFV DYSTSQEILN KSENNRIANK 200 

ILMYSGSLES TELALNKLKD FKIDKSKYSI KKDNKAFEES LESVSGIKHI 250 

IKIMTYSIML GGIVVLSLIL ILWLRERIYE IGIFLSIGTT KIQIIRQFIF 300 

ELIFISIPSI ISSLFLGNLL LKVIVEGFIN SENSMIFGGS LINKS SFMLN 350 

ITTLAESYLI LISIIVLSVV MASSLILFKK PQEILSKIS 389 
(SEQ ID NO:30) 

FIG. 5d 

MDILEIKNVN YSYANSKEKV LSGVNQKFEL GKFYAIVGKS GTGKSTLLSL 50 

LAGLDKVQTG KILFKNEDIE KKGYSNHRKN NISLVFQNYN LIDYLSPIEN 100 

IRLVNKSVDE SILFELGLDK KQIKRNVMKL SGGQQQRVAI ARALVSDAPI 150 

ILADEPTGNL DSVTAGEII (SEQ ID NO: 31) 169 

FIG. 5e 
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60 
120 
180 
240 
300 
360 
420 
480 
540 



CATATGACAA TATTTTTGAA AGTCTACATC ACTTACTCGC CTGTCGTGGA AAATCTGGCA 
ATACATTAAT CGACCAATTA GTTGCTGATG GTTTACTTCA TGCAGATAAT CACTACCATT 
TTTTCAATGG GAAGTCTCTG GCCACTTTCA ATACTAACCA ATTGATTCGC GAAGTTGTCT 
ATGTTGAAAT ATCCTTAGAT ACTATGTCTA GTGGTGAACA TGATTTAGTA AAAGTTAACA 
TTATCAGACC CACTACCGAG CATACTATCC CCACGATGAT GACAGCTAGC CCCTATCATC 
AAGGTATCAA TGATCCTGCC GCAGACCAAA AAACATACCA AATGGAGGGT GCGCTAGCAG 
TTAAACAGCC TAAACACATA CAAGTTGACA CAAAACCATT TAAAGAAGAA GTAAAACATC 
CTTCAAAATT ACCCATCAGC CCTGCAACTG AAAGCTTCAC ACACATTGAC AGTTATAGTC 
TCAATGACTA TTTTCTTTCT CGTGGTTTTG CTAATATATA CGTTTCAGGT GTGGGTACTG 
CTGGCTCTAC GGGTTTCATG ACCAGTGGGG ATTACCAACA AATACAAAGC TTTAAAGCAG 600 
TCATTGATTG GTTAAATGGT AAGGTTACTG CATTCACAAG TCATAAACGA GATAAACAAG 660 
TCAAGGCTGA TTGGTCAAAC GGCCTTGTAG CAACCACAGG TAAATCTTAT CTCGGTACCA 720 
TGTCAACTGG TTTAGCAACA ACTGGCGTTG AGGGGCTGAA AGTCATTATC GCTGAAGCCG 
CAATCTCCAC ATGGTATGAT TATTATCGAG AAAATGGGCT TGTGTGTAGT CCAGGCGGCT 
ACCCCGGTGA AGATTTAGAC GTTTTAACAG AATTAACATA CTCACGAAAC CTCTTAGCTG 
GTGATTACAT CAAAAACAAC GATTGCTATC AAGCATTGTT AAATGAACAA TCAAAAGCAA 
TTGACCGTCA AAGTGGGGAT TACAACCAAT ACTGGCATGA CCGTAATTAC CTAACTCACG 

TCAATAATGT CAAAAGTCGA GTAGTTTACA CTCATr-rz^T a™,-™™ 

UAU1 (-TCATGGACT ACAGGATTGG AATGTTAAGC 
CAAGACATGT CTACAAAGTT TTCAATGCAT Trrriv-B * *,» 

wiAi vjuAT TGCCTCAAAC CATCAAAAAA CACCTTTTTT 

TACATCAAGG TCAACATGTG TATATGCATA ATTGGCAGTC GATTGATTTT CGTGAAAGCA 
TGAATGCCTT ACTAAGCCAA GAACTACTTG GCATTGACAA TCATTTCCAA TTAGAAGAGG 
TCATTTGGCA AGATAATACT ACTGAGCAAA CTTGGCAAGT TTTAGATGCT TTCGGAGGAA 
ACCATCAAGA GCAAATTGGT TTAGGTGATA GTAAAAAACT TATTGATAAC CATTATGACA 
AAGAAGCCTT TGATACTTAT TGTAAAGACT TCAATGTGTT CAAAAATGAT CTTTTCAAGG 
GAAATAATAA AACCAAT CAA ATCACTATTA ATCTTCCTCT AAAGAAAAAT TATCTCCTGA 
ATGGACAGTG CAAACTCCAT CTACGTGTTA AAACTAGTGA CAAAAAGGCC ATTTTATCAG 
CCCAAATCTT AGACTATGGT CCTAAAAAAC GATTCAAAGA TACACCAACC ATCAAATTCT 
TAAACAGCCT TGATAATGGT AAAAATTTTG CCAGAGAAGC TTTACGTGAA CTCCCGTTTA 
CTAAAGATCA TTATCGTGTC ATCAGTAAAG GTGTCTTGAA CCTTCAAAAT CGTACAGACT 
TACTTACAAT TGAGGCTATC GAGCCAGAAC AATGGTTTGA TATCGAGTTT AGCCTCCAAC 
CAAGTATATA TCAATTGAGT AAAGGTGATA ATCTAAGGAT TATCCTTTAT ACAACTGATT 
TTGAACATAC CATTCGAGAT AATGCTAGTT ACTCTATAAC AGTAGATTTG AGTCAATCTT 
ATTTAACTAT CCCAACTAAT CAAGGAAATT AACTTATGAA ACTTCTTACT AAAGAACGGT 
TTGATGATTC TCAACACTTT TGGTACCAGA TCAATTTATT ACAAGAGAGT AACTTCGGAG 
CAGTTTTTGA CCATGATAAT AAAAACATTC CACAGGTTGT TGCAACTATT GTTGATGATT 
TACAAGGTTC CGGAAGTTCG AATCATTTCT GGTATTTTGG CAATACTACT GATACTTCCA 
TCCTTATGAT TGCTCATTTA AATCGAAAAT TCTATATTCA GGTTAATTTA AAGGACTTTG 
ACTTTGCACT CAATTTAATA GCTATAAATA ATTGGAAGAG TCTCCTCCAA ACTCAACTTG 
AAGCT CTAAA CGATACCCTA GCAATATTTC AATAAATAAG GTAGAATGGA GTGACAAAGC 
AACGCGAGGG AGACTGATTA ATGTCATCTT ATTGGAATAA CTATCCTGAA CTTAAAAAAA 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
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ATATTGATGA AACCAATCAA CTAATTCAAG AAAGAATACA GGTCAGAAAT AAAGATATTG 2 4 60 
AAGCGGCGCT AAGCCAACTC ACAGCTGCGG GAGGAAAACA GCTCAGACCA GCATTCTTTT 2520 
ACCTTTTTTC TCAACTTGGT AATAAGGAGA ATCAAGATAC TCAGCAACTA AAGAAAATCG 2580 
CTGCTTCTTT AGAAATCCTT CACGTTGCTA CATTAATCCA TGATGATGTC ATTGATGACT 2640 
CACCACTAAG ACGTGGAAAT ATGACCATTC AAAGCAAGTT TGGCAAAGAC ATCGCAGTTT 2700 
ATACTGGGGA TTTACTTTTC ACAGTCTTTT TCGATCTTAT TTTAGAATCT AT G ACT GAT A 27 60 
CACCATTTAT GAGGATTAAT GCAAAATCTA TGCGTAAAAT TCTCATGGGA GAATTGGACC 2820 
AGATGCACCT TCGTTACAAT CAACAACAAG GTATCCATCA CTATTTACGT GCGATTTCAG 2880 
GTAAGACAGC CGAACTCTTT AAATTAGCTA GCAAAGAAGG AGCTTACTTT GGTGGTGCAG 2940 
AGAAGGAGGT TGTTCGTCTA GCAGGCCATA TCGGCTTTAA CATTGGTATG ACATTCCAAA 3000 
TTTTGGATGA TATCCTGGAT TATACTGCAG ATAAAAAAAC ATTTAATAAG CCTGTCTTAG 3060 
AGGATTTAAC ACAAGGCGTT TACAGCCTTC CTCTACTTCT TGCCATTGAA GAAAATCCTG 3120 
ATATTTTCAA ACCTATTTTA GATAAAAAAA CAGATATGGC TACTGAAGAC ATGGAAAAAA 3180 
TTGCTTATCT CGTCGTTTCC CATAGAGGTG TTGACAAAGC TCGCCATCTA GCTCGTAAAT 3240 
TTACTGAGAA AGCTATTAGT GACATAAATA AGCTACCCCA GAACTCTGCA AAAAAACAGT 3300 
TGCTACAATT AACTAATTAC CTTTTAAAAC GCAAAATTTA AATAATAAAA AAACATTCCA 3360 
CAATGCTAGA AAAGCAGTTA GGGAATGTTT TTTTATTATC ATTTATTTAT CGCACCTATC 3420 
AATCATCATA GATCACCATC ATCAGCGGCT TTCAGCTGAC GGTAACGTTG ACTACTTTGA 34 80 
GACAATTCTT GAGGAGAACC TTCCAACTCT AATTGCCCAT TTTCTATAAA TAAGATACGA 354 0 
TCAGCATGTT CAATACCTTT TAAGTGATGT GTAATCCAAA CTAAGGTCTT ACCTTCCAAT 3600 
TCTTTCATAA ATACCCTTAG TAAGGCTTGT TCAGTAATAG GATCAAGTCC AACAGTTGGC 3660 
TCATCTAAGA TAACAATTGG GACATCTTTT AGTAAGATTC TAGCCAAAGC AATTCTATGC 3720 
CTTTCGCCAC CTGAAAACCT AAGTCCAGCT TCATCAACCA TTGTATAGAG ACCATCTGAT 3780 
AAATCAGTGA CCATCTCTTT CAATCCAACT CGTTCAAGAA CTTTCCATAC ATCTTCTTCA 384 0 
CTAGCATCTT GGTTTCCAAT GCGAATGTTA TTTAGCAGGG TTGTATTAAA AAGGTAGGGC 3900 
GCTTGTTGTA TCACTCCAAT ATAGTTAGAA ATGCAATCAC CAACTATTGA AACAT CAGC A 3 960 
CCGCCTAGGG TAATCTTCCC TTGACTTGCT TTCAAGTCGC CACGAAGTAG AC TAG C T AAG 4 020 
GTACTCTTGC CAGAACCACT CCGCCCTAAA ATAGCAATTT TTTCTCCTTC TTTAATATCC 4 080 
AAATCTAAAT GATGCAAAAC CCATTTCTCT TGTGGCTTAT ACTGGAAACT TAAATTCTTG 4140 
ACGGAAAAAT CATATGGCTT ATTAGGCAAT T (SEQ ID NO: 32) 4171 

FIG. 6a 
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YDNIFQSLHH LLACRGKSGN TLIDQLVADG LLHADNHYHF FNGKSLAT FN 
TNQLIREVVY VEISLDTMSS GEHDLVKVNI IRPTTEHTIP TMMTASPYHO 
GINDPAADQK TYQMEGALAV KQPKHIQVDT KPFKEEVKHP SKLPISPATE 
SFTHIDSYSL NDYFLSRGFA NIYVSGVGTA GSTGFMTSGD YQQIQSFKAV 
I DWLNGKVTA FTSHKRDKQV KADWSNGLVA TTGKSYLGTM STGLATTGVE 
GLKVIIAEAA ISTWYDYYRE NGLVCSPGGY PGEDLDVLTE LTYSRNLLAG 
DYIKNNDCYQ ALLNEQSKAI DRQSGDYNQY WHDRNYLTHV NNVKSRVVYT 
HGLQDWNVKP RHVYKVFNAL PQTIKKHLFL HQGQHVYMHN WQSIDFRESM 
NALLSQELLG IDNHFQLEEV IWQDNTTEQT WQVLDAFGGN HQEQIGLGDS 
KKLIDNHYDK EAFDTYCKDF NVFKNDLFKG NNKTNQITIN LPLKKNYLLN 
GQCKLHLRVK TSDKKAILSA QILDYGPKKR FKDTPTIKFL NSLDNGKNFA 
REALRELPFT KDHYRVISKG VLNLQNRTDL LTIEAIEPEQ WFDIEFSLQP 
SIYQLSKGDN LRIILYTTDF EHTIRDNASY SITVDLSQSY LTIPTNQGN 
(SEQ ID NO:33) 

FIG. 6b 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
649 



MKLLTKERFD DSQHFWYQIN LLQESNFGAV FDHDNKNIPQ VVATIVDDLQ 
GSGSSNHFWY FGNTTDTSIL MIAHLNRKFY IQVNLKDFDF ALNLIAINNW 
KSLLQTQLEA LNDTLAIFQ (SEQ ID NO:34) 



FIG. 6c 



MSSYWNNYPE LKKNIDETNQ LIQERIQVRN KDIEAALSQL TAAGGKQLRP 
AFFYLFSQLG NKENQDTQQL KKIAASLEIL HVATLIHDDV IDDSPLRRGN 
MTIQSKFGKD IAVYTGDLLF TVFFDLILES MTDTPFMRIN AKSMRKILMG 
ELDQMHLRYN QQQGIHHYLR AISGKTAELF KLASKEGAYF GGAEKEVVRL 
AGHIGFNIGM TFQILDDILD YTADKKTFNK PVLEDLTQGV YSLPLLLAIE 
ENPDIFKPIL DKKTDMATED MEKIAYLVVS HRGVDKARHL ARKFTEKAIS 
DINKLPQNSA KKQLLQLTNY LLKRKI (SEQ ID NO-35) 



50 
100 
119 



50 
100 
150 
200 
250 
300 
326 



FIG. 6d 
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LPNKPYDFSV KNLSFQYKPQ EKWVLHHLDL DIKEGEKIAI LGRSGSGKST 50 

LASLLRGDLK ASQGKITLGG ADVSIVGDCI SNYIGVIQQA PYLFNTTLLN 100 

NIRIGNQDAS EEDVWKVLER VGLKEMVTDL SDGLYTMVDE AGLRFSGGER 150 

HRIALARILL KDVPIVILDE PTVGLDPITE QALLRVFMKE LEGKTLVWIT 200 

HHLKGIEHAD RILFIENGQL ELEGSPQELS QSSQRYRQLK AADDGDL 247 
(SEQ ID NO:36) 

FIG. 6e 
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AGGCAAGTTC 


CTTAGTTGTT 


60 


CGGATACTTA 


TGTCCGTCCA 


120 


CTGGACATTG 


TGGGGTGGAT 


180 


ATGGTACTGT 


GAAATTTGCA 


240 


GAAATTGTGT 


CATGATTCAA 


300 


GTGTGGTGGC 


TAGGACTGGG 


360 


CAACTGGTAT 


GGCGACGGGA 


420 


TTCAAAATGG 


TTTCCATGGA 


480 


TTAGTGGAAA 


AACGCAAGCA 


540 


AGAATCAATC 


TAGTAAATTA 


600 


TTTGGTTAGT 


CAAAAATAAC 


660 


TACCAGCATC 


AGAAATTGAT 


720 


TTCAAAAAGG 


TGGTTACTTT 


780 


TCCAAGGAAC 


AGCTGGTTTA 


840 


GGCTTCGCGT 


TGACAACAGT 


900 


GTTTTAAATG 


ACAGTTTTGT 


960 



1 ^AAAAT AA TTTTATAGTC CAGTAAAGTG 
TGATATTATA GTCTCGGACT AATAAAAAGG AAATAGGAAT TGAAGCAATG AAAATGAATA 
AAAAGGTACT ATTGACATCG ACAATGGCAG CTTCGCTATT ATCAGTCGCA AGTGTTCAAG 
CACAAGAAAC AGATACGACG TGGACAGCAC GTACTGTTTC AGAGGTAAAG GCTGATTTGG 
TAAAGCAAGA CAATAAATCA TCATATACTG TGAAATATGG TGATACACTA AGCGTTATTT 
CAGAAGCAAT GTCAATTGAT ATGAATGTCT TAGCAAAAAT TAATAACATT GCAGATATCA 
ATCTTATTTA TCCTGAGACA ACACTGACAG TAACTTACGA TCAGAAGAGT CATACTGCCA 
CTTCAATGAA AATAGAAACA CCAGCAACAA ATGCTGCTGG T C AAACAACA GCTACTGTGG 
ATTTGAAAAC CAATCAAGTT TCTGTTGCAG ACCAAAAAGT TTCTCTCAAT ACAATTTCGG 
AAGGTATGAC ACCAGAAGCA GCAACAACGA TTGTTTCGCC AATGAAGACA TATTCTTCTG 
CGCCAGCTTT GAAATCAAAA GAAGTATTAG CACAAGAGCA AGCTGTTAGT CAAGCAGCAG 
CTAATGAACA GGTATCAACA GCTCCTGTGA AGTCGATTAC TTCAGAAGTT CCAGCAGCTA 
AAGAGGAAGT TAAACCAACT CAGACGTCAG TCAGTCAGTC AACAACAGTA TCACCAGCTT 
CTGTTGCCGC TGAAACACCA GCTCCAGTAG CTAAAGTAGC ACCGGTAAGA ACTGTAGCAG 
CCCCTAGAGT GGCAAGTGTT AAAGTAGTCA CTCCTAAAGT AGAAACTGGT GCATCACCAG 
AGCATGTATC AGCTCCAGCA GTTCCTGTGA CTACGACTTC AACAGCTACA GACAGTAAGT 
TACAAGCGAC TGAAGTTAAG AGCGTTCCGG TAGCACAAAA AGCTCCAACA GCAACACCGG 
TAGCACAACC AGCTTCAACA ACAAATGCAG TAGCTGCACA TCCTGAAAAT GCAGGGCTCC 
AACCTCATGT TGCAGCTTAT AAAGAAAAAG TAGCGTCAAC TTATGGAGTT AATGAATTCA 
GTACATACCG TGCAGGTGAT CCAGGTGATC ATGGTAAAGG TTTAGCAGTC GACTTTATTG 
TAGGTAAAAA CCAAGCACTT GGTAATGAAG TTGCACAGTA CTCTACACAA AATATGGCAG 
CAAATAACAT TTCATATGTT ATCTGGCAAC AAAAGTTTTA CTCAAATACA AATAGTATTT 
ATGGACCTGC TAATACTTGG AATGCAATGC CAGATCGTGG TGGCGTTACT GCCAACCATT 
ATGACCATGT TCACGTATCA TTTAACAAAT AATATAAAAA AGGAAGCTAT TTGGCTTCTT 



1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 
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TTTTATATGC 


CTTGAATAGA 


CTTTCAAGGT 


TCTTATCTAA 


TTTTTATTAA 


ATTGAGGAGA 


2460 


TTAAGCTATA 


AGTCTGAAAC 


TACTTTCACG 


TTAACCGTGA 


CTAAATCAAA 


ACGTTAAAAC 


2520 


T AAAATCTAA 


GTCTGTAAAG 


AT TAT TG AAA 


ACGCTTTAAA 


AACAGATATA 


^k. tf^kj -M) % .MM Mk Mfe MJk Mi 

ATAAGGTTTG 


2580 


TAGATATCTA 


- - ^MM ^ Mfe M ^i 

AAATTAAAAA 


«b ^Mt Bbi aBB -Ski Mh ^k> a, -ah 

AGATAAGGAA 


**** M*kj «Jk m* m> mm* «h mmj 

GTGAGAATAT 


GCCACATCTA 


AGTAAAGAAG 


2640 


CTTTTAAAAA 


GCAAATAAAA 


AATGGCATTA 


TTGTGTCATG 


TCAAGCTTTG 


CCTGGGGAGC 


2700 


CTCTTTATAC 


TGAAAGTGGA 


GGTGTTATGC 


CTCTTTTAGC 


TTTGGCAGCT 


CAAGAAGCAG 


2760 


GAGCGGTTGG 


TATAAGAGCC 


AATAGTGTCC 


GCGACATTAA 


GGAAATTCAA 


GAAGTTACTA 


2820 


ATTTACCTAT 


CATCGGCATT 


ATTAAACGTG 


AATATCCTCC 


ACAAGAACCA 


TTTATCACTG 


2880 


CTACGATGAC 


AGAGGTGGAT 


CAATTAGCTA 


0Mi Ml Mh Mfc Mi ^i Hi 

GTTTAGATAT 


TGCAGTAATA 


GCCTTAGATT 


2940 


GT AC ACT TAG 


AGAGCGTCAT 


GATGGTTTGA 


GTGTAGCTGA 


GTTTATTCAA 


AAGATAAAAG 


3000 


GGAAATATCC 


TGAACAGTTG 


CTAATGGCTG 


ATATAAGTAC 


TTTTGAAGAA 


GGTAAAAATG 


3060 


CTTTTGAAGC 


AGGAGTTGAT 


TTTGTGGGTA 


CAACTCTATC 


TGGATACACA 


GATTACAGCC 


3120 


GCCAAGAAGA 


AGGACCGGAT 


ATAGAACTCC 


TTAATAAGCT 


TTGTCAAGCC 


GGTATAGATG 


3180 


TGATTGCGGA 


AGGTAAAATT 


CATACTCCTA 


AGCAAGCTAA 


TGAAATTAAT 


CATATAGGTG 


3240 


TTGCAGGAAT 


TGTAGTTGGT 


GGTGCTATCA 


CTAGACCAAA 


AGAAATAGCG 


GAGCGTTTCA 


3300 


TCTCAGGACT 


TAGTTAAAAG 


TGTTACTCAA 


AAATCAAAAT 


CAAAATAAAA 


AAGGGGAATA 


3360 


GTTATGAGTA 


TCAAAAAAAG 


TGTGATTGGT 


TTTTGCCTCG 


GAGCTGCAGC 


ATTATCAATG 


3420 


TTTGCTTGTG 


TAGACAGTAG 


TCAATCTGTT 


ATGGCTGCCG 


AGAAGGATAA 


AGTCGAAATT 


3480 



(SEQ ID NO:37) 

FIG . 7a 



NSIWRFFLNK WLVKASSLVV LGGMVLSAGS RVLADTYVRP IDNGRITTGF 50 

NGYPGHCGVD YAVPTGTIIR AVADGTVKFA GAGANFSWMT DLAGNCVMIQ 100 

HADGMHSGYA HMSRVVARTG EKVKQGDIIG YVGATGMATG PHLHFEFLPA 150 

NPNFQNGFHG RINPTSLIAN VATFSGKTQA SAPSIKPLQS APVQNQSSKL 200 

KVYRVDELQK VNGVWLVKNN TLTPTGFDWN DNGIPASEID EVDANGNLTA 250 

DQVLQKGGYF IFNPKTLKTV EKPIQGTAGL TWAKTRFANG SSVWLRVDNS 300 

QELLYK (SEQ ID NO: 38) 306 

FIG. 7b 
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MKMNKKVLLT STMAASLLSV ASVQAQETDT TWTARTVSEV KADLVKQDNK 50 

SSYTVKYGDT LSVISEAMSI DMNVLAKINN IADINLI YPE TTLTVTYDQK 100 
SHTATSMKIE TPATNAAGQT TATVDLKTNQ VSVADQKVSL NTISEGMTPE 
AATTI VSPMK TYSSAPALKS KEVLAQEQAV SQAAANEQVS TAPVKSITSE 
VPAAKEEVKP TQTSVSQSTT VSPASVAAET PAPVAKVAPV RTVAAPRVAS 

VKVVTPKVET GASPEHVSAP AVPVTTTSTA TDSKLQATEV KSVPVAQKAP 300 

TATPVAQPAS TTNAVAAHPE NAGLQPHVAA YKEKVASTYG VNEFSTYRAG 350 

DPGDHGKGLA VDFIVGKNQA LGNEVAQYST QNMAANNISY VIWQQKFYSN 4 00 

TNSI YGPANT WNAMPDRGGV TANHYDHVHV SFNK (SEQ ID NO: 39) 4 34 



FIG. 7c 



FIG. 7d 



MSIKKSVIGF CLGAAALSMF ACVDSSQSVM AAEKDKVEI 
(SEQ ID NO:41) 



150 
200 
250 



MPHLSKEAFK KQIKNGIIVS CQALPGEPLY TESGGVMPLL ALAAQEAGAV 50 

GIRANSVRDI KEIQEVTNLP IIGIIKREYP PQEPFITATM TEVDQLASLD 100 

IAVIALDCTL RERHDGLSVA EFIQKIKGKY PEQLLMADIS TFEEGKNAFE 150 

AGVDFVGTTL SGYTDYXRQE EGPDIELLNK LCQAGIDVIA EGKIHTPKQA 200 

NEINHIGVAG IVVGGAITRP KEIAERFISG LS (SEQ ID NO: 40) 232 



39 



FIG. 7e 
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ATGAAAATGA ATAAAAAGGT ACTATTGACA 
ATTATCAGTC GCAAGTGTTC AAGCACAAGA 
CACGTACTGT TTCAGAGGTA AAGGCTGATT 
TCATCATATA CTGTGAAATA TGGTGATACA 
AATGTCAATT GATATGAATG TCTTAGCAAA 
TCAATCTTAT TTATCCTGAG ACAACACTGA 
AGTCATACTG CCACTTCAAT GAAAATAGAA 
TGGTCAAACA ACAGCTACTG TGGATTTGAA 
CAGACCAAAA AGTTTCTCTC AATACAATTT 
GCAGCAACAA CGATTGTTTC GCCAATGAAG 
TTTGAAATCA AAAGAAGTAT TAGCACAAGA 
CAGCTAATGA ACAGGTATCA ACAGCTCCTG 
GTTCCAGCAG CTAAAGAGGA AGTTAAACCA 
GTCAACAACA GTATCACCAG CTTCTGTTGC 
TAGCTAAAGT AGCACCGGTA AGAACTGTAG 
GTTAAAGTAG TCACTCCTAA AGTAGAAACT 
ATCAGCTCCA GCAGTTCCTG TGACTACGAC 
AGTTACAAGC GACTGAAGTT AAGAGCGTTC 
ACAGCAACAC CGGTAGCACA ACCAGCTTCA 
ACATCCTGAA AATGCAGGGC TCCAACCTCA 
AAGTAGCGTC AACTTATGGA GTTAATGAAT 
GATCCAGGTG AT CATGGTAA AGGTTTAGCA 
AAACCAAGCA CTTGGTAATG AAGTTGCACA 
CAGCAAATAA CATTTCATAT GTTATCTGGC 
ACAAATAGTA TTTATGGACC TGCTAATACT 
TGGTGGCGTT ACTGCCAACC ATTATGACCA 
AATAA 

{ SEQ ID NO:42) 

FIG. 



TCGACAATGG CAGCTTCGCT 50 

AACAGATACG ACGTGGACAG 100 

TGGTAAAGCA AGACAATAAA 150 

CTAAGCGTTA TTTCAGAAGC 200 

AATTAATAAC ATTGCAGATA 250 

CAGTAACTTA CGATCAGAAG 3 00 

ACACCAGCAA CAAATGCTGC 350 

AACCAATCAA GTTTCTGTTG 4 00 

CGGAAGGTAT GACACCAGAA 450 

ACATATTCTT CTGCGCCAGC 500 

GCAAGCTGTT AGT CAAGCAG 550 

TGAAGTCGAT TACTTCAGAA 600 

ACTCAGACGT CAGTCAGTCA 650 

CGCTGAAACA CCAGCTCCAG 700 

CAGCCCCTAG AGTGGCAAGT 750 

GGTGCATCAC CAGAGCATGT 800 

TTCAACAGCT ACAGACAGTA 850 

CGGTAGCACA AAAAGCTCCA 900 

ACAACAAATG CAGTAGCTGC 950 

TGTTGCAGCT TATAAAGAAA 1000 

TCAGTACATA CCGTGCAGGT 1050 

GTCGACTTTA TTGTAGGTAA 1100 

GTACTCTACA CAAAATATGG 1150 

AACAAAAGTT TTACTCAAAT 1200 

TGGAATGCAA TGCCAGATCG 1250 

TGTTCACGTA TCATTTAACA 1300 

1305 
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?^™^A ilrriir?^ JACAGCACGT ACTGTTTCAG AGGTAAAGGC 
TGATTTGGTA AAGCAAGACA ATAAATCATC ATATACTGTG AAATATGOTr 
SSSfISi? ^TTATTTCA GAA GCAATGT CAATTGATAT S£?S?ot£ 
GCAAAAATTA ATAACATTGC AGATATCAAT CTTATTTATC CTG AG A c sir 
ACTGACAGTA ACTTACGATC AGAAGAGTCA TACTGCCACT ?S£S2f? 
TAGAAACACC AGCAACAAAT GCTGCTGGTC AAACAACAGC 
TTGAAAAC CA ATCAAGTTTC TGTTGCAGAC CAAAAaS?? 

?^sss s?s?s gc ssss^s ssssss 

SiSS SSSSSS SSZSS2S 

TCCTGTGAAG TCGATTACTT CAGAAGTTCC AgSgCTAAA SSSaS?? 

AGTCAGTCAA OAcSSSS SSSSKS 
GTTGCCGCTG AAACACCAGC TCCAGTAGCT AAAGTAGCAr rrr-™*r-7s*A 
TGTAGCAGCC CCTAGAGTGG CAAGTGT?AA A^AGT^ OTA^g™ 
AAACTGGTGC ATCACCAGAG CATGTATCAG CTCCAGCAGT JcC^gIot 

SSSSSSJ ^E52S CAGTAAGTTA CAaSSSE 

C-CjTTCCGGTA GCACAAAAAG CTCCAACAGC AACACCGGTA GPAPAArrar 
CTTCAACAAC AAATGCAGTA GCTGCACATC CTGAaStGC 
CCTCATGTTG CAGCTTATAA AGAAAAAGTA GCGTCAACTT j^SSS^ 

tgaattcagt acataccgtg caggtgatcc aggtga^S? 

TAGCAGTCGA CTTTATTGTA GGTAAAAACC AAGCACTTGG TaSS£S 

GCACAGTACT CTACACAAAA TATGGCAGCA AATAaSt?? ^SJ?5? 

CTGGCAACAA AAGTTTTACT CAAATACAAA TAGTATTtS SSccSSI 

ATACTTGGAA TGCAATGCCA GATCGTGGTG GCGTTAOTGC 2S£S?5t 
GACCATGTTC ACGTATCATT TAACAAATAA ( SEQ ID 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1230 



FIG. 9 



QETDTTWTAR 


TVSEVKADLV 


KQDNKSSYTV 


KYGDTLSVIS 


EAMSIDMNVL 


50 


AKINNIADIN 


LIYPETTLTV 


TYDQKSHTAT 


SMKIETPATN 


AAGQTTATVD 


100 


LKTNQVSVAD 


QKVSLNTISE 


GMTPEAATTI 


VSPMKTYSSA 


PALKSKEVLA 


150 


QEQAVSQAAA 


NEQVSTAPVK 


SITSEVPAAK 


EEVKPTQTSV 


SQSTTVSPAS 


200 


VAAETPAPVA 


KVAPVRTVAA 


PRVASVKVVT 


PKVETGASPE 


HVSAPAVPVT 


250 


TTSTATDSKL 


QATEVKSVPV 


AQKAPTATPV 


AQPASTTNAV 


AAHPENAGLQ 


300 


PHVAAYKEKV 


ASTYGVNEFS 


TYRAGDPGDH 


GKGLAVDFIV 


GKNQALGNEV 


350 


AQYSTQNMAA 


NNISYVIWQQ 


KFYSNTNSIY 


GPANTWNAMP 


DRGGVTANHY 


400 


DHVHVSFNK 


(SEQ ID NO: 


44) 






409 



FIG . 9a 
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Experience MB-14 
NCS 954 ( type III) 
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Experience MB -20 
NCS 535 (type V) 


• 






•••• 
• 


• 
• 


•••• 
• 






post-1 St 


post-2nd 


post-3rd 



10 s - 
10*- 
10'- 
10*- 
10t 
10* 



Experience MB -16 
ATCC 12401 (type lb) 
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Experience MB-21 
NCS 9842 (type VI) 
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SEQUENCE LISTING 

<110> BioChem Vaccins 
RIOUX, Clement 
DENIS , Martin 
BRODEUR , Bernard R . 
HAMEL, Josee 
CHARLEBOIS, Isabelle 
BOYER, Martine 



<12 0> NOVEL GROUP B STREPTOCOCCUS ANTIGENS 



<130> 12806-9PCT 

<150> 60/075,425 
<151> 1998-02-20 

<160> 44 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 4514 
<212> DNA 

<213> Streptococcus 

<220> 

<221> CDS 

<222> (3 ) . . . (464) 

<221> CDS 

<222> (534) . . . (887) 
<223> 



<221> CDS 

<222> (1024) . . . (1767) 
<221> CDS 

<222> (1841) . . . (4288) 
<221> CDS 

<222> (2735) . . . (4288) 
<400> 1 

ta tct ggc aaa gag cca get aat cgt ttt agt tgg get aaa aat aaa 47 
Ser Gly Lys Glu Pro Ala Asn Arg Phe Ser Trp Ala Lys Asn Lys 
1 5 io 15 
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t-t-a tta ate aat gga ttc att gca act eta gca gca act ate tta ttt 
ieu Leu ill Asn fly Phe He Ala Thr Leu Ala Ala Thr lie Leu Phe 

20 25 30 



95 



ttc ata ggt etc aaa cca gat tac cct gga aaa acc 



t-t-t- ara att caa ttc ata ggt t-uu ^aa y"- 

Phe La Val Gin Phe He Gly Leu Lys Pro Asp Tyr Pro Gly Lys Thr 

35 40 45 

tac ttt att ate eta ttg aca gca tgg act ttg atg gca tta gta act 
Tyr lie lie lie Leu Leu Thr Ala Trp Thr Leu Met Ala Leu Val Thr 
50 55 60 

act tta qtq gga tgg gat aat agg tat ggt tec ttc ttg teg tta tta 
111 Leu fa? G?y Trp Lp Asn Arg Tyr Gly Sar Phe Leu Ser Leu Leu 
65 70 75 

ata tta tta ttc cag ctt ggt tea age gca gga act tac cca ata gaa 
Tie leu Leu Phe Gin Leu Gly Ser Ser Ala Gly Thr Tyr Pro lie Glu 

tta aat cct aag ttc ttt caa aca att eaa cca ttt tta ccg atg act 
HI sir Pro Lys Phe Phe Gin Thr He Gin Pro Phe Leu Pro Met Thr 

100 105 110 

tac tct gtt tea gga tta aga gag acc ate teg ttg acg gga gac gtt 
Tyr til Val Ser Gly Leu Arg Glu Thr He Ser Leu Thr Gly Asp Val 

115 120 125 

aac cat caa tgg aga atg eta gta ate ttt tta gta tea teg atg ata 
111 Til Gin Trp Arg Met Leu Val He Phe Leu Val Ser Ser Met He 
130 135 140 

ctt get ctt ctt att tat cgt aaa caa gaa gat taatagaaag tatctagtga 
Leu Ala Leu Leu He Tyr Arg Lys Gin Glu Asp 
145 150 

tagactaaea gtatgatatg gtatgtcaaa gtatttagga ggagaagat atg tct act 

155 

tta aca ata att att gca aca tta act get ttg gaa cat ttt tat att 
Leu Thr He He He Ala Thr Leu Thr Ala Leu Glu His Phe Tyr He 
160 165 I 70 

atg tat ttg gag acg tta gec acc cag tea aat atg act ggg aag att 
Met Tyr Leu Glu Thr Leu Ala Thr Gin Ser Asn Met Thr Gly Lys He 
175 180 185 

ttt agt atg tct aaa gaa gag ttg tea tat tta ccc gtt att aaa ctt 
Phe Ser Met Ser Lys Glu Glu Leu Ser Tyr Leu Pro Val He Lys Leu 
190 195 200 205 

ttt aag aat caa ggt gta tac aac ggc ttg att ggc eta ttc etc ctt 
Phe Lvs Asn Gin Gly Val Tyr Asn Gly Leu lie Gly Leu Phe Leu Leu 

210 215 220 



143 



191 



239 



287 



335 



383 



431 



484 



542 



590 



638 



686 



734 
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tat ggg tea tat att tea cag aat caa gaa att gta get «t ttt 
Tyr Gly Leu Tyr He Ser Gin Asn Gin Glu He Sal La TTl T h t £ 

225 23 ° 235 

e s 2: s m n: s 15 is s s e 

240 245 250 - " 

aaa ate tta tta aaa cag ggt ggt tta cct ata tta get ctt tta ac = 
L ys lie Leu Leu Lys Gin Gly Gly Leu Pro He Leu La Leu Leu Tr 
- 5a 260 265 

Se leu lit taataCtaCt tagcegtteg atttagttga aeggetttta 

270 



782 



830 



878 



927 



n«t«ct«c ItltTlTT ataCa ^ ta 9 tttaagtaat ttgtetttaa aaatagtata 
atataactac gaattcaaag agaggtgact ttgatt atg act gag aac tgg tta 

Met Thr Glu Asn Trp Leu 
275 

His Thr Lv<5 I" rT c Ca a " t3t tat Cgt 9 tC 9" 39* caa ggt 

His Thr Lys Asp Gly Ser Asp He Tyr Tyr Arg Val Val Gly Gin Gly 

285 290 

caa ccg att gtt ttt tta cat ggc aat age tta agt agt cgc tat ttt 
Gin Pro lie Val Phe Leu His Gly Asn Ser Leu Ser sir tT S Tyr Phe 

300 305 310 

gat aag caa ata gca tat ttt tct aag tat tac caa art- a t-t- .- 
Asp Lys Gin He Ala Tyr Phe Ser Lys^ Tyr Tyr Tin TTl lit TTl Tel 

315 320 325 

Asp III To n? 9 «" 933 39t Cat 9 ca aa 9 "a aat ace att agt 

Asp ser Arg Gly H ls Gly Lys Ser His Ala Lys Leu Asn Thr He sir 

J3 ° 33 5 340 

Phe 111 "n Tit IT T ? ? aC a " gat 3tC tta 3tt cat tta gag 

Phe Arg Gin He Ala Val Asp Leu Lys Asp He Leu Val His Leu Glu 

345 350 355 

He IT lvs TT rT T 9 9 T " C 39C gat S gt gcc aat "a get 

He Asp Lys Val lie Leu V al Gly His Ser Asp Gly Ala Asn Leu La 

365 370 

tta gtt ttt caa acg atg ttt cca ggt atg gtt aaa ggg ctt ttq ctt 
Leu Val Pne Gin Thr Met Phe Pro Gly Met Val Arg Gly Itu Leu Leu 

380 3 85 3 9 o 

Tsl Ser Itn x" 9 ^ C " Cag ^ ^ 9" «t ctt 

Asn Ser Gly Asn Leu Thr He His Gly Gin Arg Trp Trp Asp He Leu 

395 400 

tta gta agg att gec tat aaa ttc ctt cac tat tta ggg aaa etc ttt 
Leu Val Arg lie Ala Tyr Lys Phe Leu His Tyr Leu G^y I" Pne 

410 415 420 



987 
1041 



1089 



1137 



1185 



1233 



1281 



1329 



1377 



1425 



1473 
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ccg tat atg agg caa aaa get caa gtt att teg ctt atg ttg gag gat 1521 



Pro Tyr Met Arg Gin Lys Ala Gin Val He Ser Leu Met Leu Glu Asp 
425 430 435 

ttg aag att agt cca get gat tta cag cat gtg tea act cct gta atg 
Leu Lys He Ser Pro Ala Asp Leu Gin His Val Ser Thr Pro Val Met 
440 445 450 

gtt ttg gtt gga aat aag gac ata att aag tta aat cat tct aag aaa 
Val Leu Val Gly Asn Lys Asp He He Lys Leu Asn His Ser Lys Lys 
455 460 465 470 

ctt get tct tat ttt cca agg ggg gag ttt tat tct tta gtt ggc ttt 
Leu Ala Ser Tyr Phe Pro Arg Gly Glu Phe Tyr Ser Leu Val Gly Phe 

475 480 485 

ggg cat cac att att aag caa gat tec cat gtt ttt aat att att gca 
Gly His His He He Lys Gin Asp Ser His Val Phe Asn He He Ala 

490 495 500 

aaa aag ttt ate aac gat acg ttg aaa gga gaa att gtt gaa aaa get 
Lys Lys Phe He Asn Asp Thr Leu Lys Gly Glu He Val Glu Lys Ala 
505 510 515 

aat tga aaaagtcaaa tcactgactt ctgtgattaa aattgtattt tttatatctg 
Asn * 



1569 



1617 



1665 



1713 



1761 



1817 



ttttagtgct tattattgtt gaa acg att cat ttg aaa cga act att tct gtt 

Met He His Leu Lys Arg Thr He Ser Val 
520 525 

gag caa eta aag agt gtt ttt ggg caa tta tct cca atg aat ctt ttc 
Glu Gin Leu Lys Ser Val Phe Gly Gin Leu Ser Pro Met Asn Leu Phe 
530 535 540 545 

tta att ate ctt gtg ggg gtt ate get gtc tta ccg aca ace gga tat 
Leu He He Leu Val Gly Val He Ala Val Leu Pro Thr Thr Gly Tyr 

550 555 560 

gac ttt gta ctg aat gga ctt tta cgt aca gat aaa age aaa agg tat 
Asp Phe Val Leu Asn Gly Leu Leu Arg Thr Asp Lys Ser Lys Arg Tyr 

565 570 575 

att tta cag act agt tgg tgt ate aac act ttt aat aac ttg tea gga 
He Leu Gin Thr Ser Trp Cys He Asn Thr Phe Asn Asn Leu Ser Gly 
580 585 590 

ttc ggt ggc tta ate gat att ggg ttg cgc atg get ttt tat ggt aaa 
Phe Gly Gly Leu lie Asp He Gly Leu Arg Met Ala Phe Tyr Gly Lys 
595 600 605 

aaa ggt caa gag aag agt gac eta aga gaa gtg act cgt ttt tta ccc 
Lys Gly Gin Glu Lys Ser Asp Leu Arg Glu Val Thr Arg Phe Leu Pro 
610 615 620 625 



1870 



1918 



1966 



2014 



2062 



2110 



2158 
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Tyl Leu" Til til It IT 5" " = 9t9 " C 9CC C " *" "3 

iyr Leu lie Ser Gly Leu Ser Phe lie Ser Val lie Ala Leu He Met 



640 



745 



750 



IS fit Til IK p" f" 3Ct 9t9 9 " 9 " tgg tta tta ctt 

Ala Ala Glu Gly Leu Pro Lys Glu Thr Val Val Ala Trp Leu Leu Leu 



? 60 765 



tat cgt tta gcc tac tat att at 



Tyr Arg Leu Ala Tyr Tyr He lie Pro Phe Phe f la 9 ? y £ £ » = 

/75 780 



785 



s £ s: £ s: sr y s £ s: e ;s s £ i" x; 

790 7 95 800 

ccg aaa gag tta gta tea act gtt eta caa acc atg gtg age cat tta 
Pro Lys Glu Leu Val Ser Thr Val Leu Gin Thr Mel Vl Ser Sis Leu 



815 



atg cgt att tta ggt gca ttc tta ata ttt tea aca oca t-n- t-f-t- ~ 

Met Arg lie Leu Gly Ala Phe Leu He Phe sTr ?Tr III ^ J£ £J 

825 830 

aat att act tat att atg tgg ttg cag aag eta ggc tta qa c cca tt* 

Asn He Thr Tyr He Met Trn I ru n „ T , 9 9 tta 

835 7 SC lrp Leu Gln L Y S Leu Gly Leu Asp Pro Leu 

84 0 a a r 



2206 



2302 



2350 



Ser His iij L 9 ? C 332 9CC 3 9 C 9" 9 ac "c tat tat tta gta ,,54 

Ser H1S lie Phe H 1S Ala Lys Ala Ser Val Asp Tyr Tyr Tvr Leu Val 

645 "0 sis 

tta att ggt get agt atg tat ttt cct gtt att tat tac att trr nr,r 

116 Tel Ma T> ' r ^ Pr ° ?al Ile ^ S Ser I?J 

665 67 Q 

cat aaa gga age cat tat ttc gga gat atg cca tct agt act cat ata 
H iS Lys Gly Ser His Tyr Phe Gly Asp Met Pro Ser Ser JSr ll g Ile 

680 685 

E E SJ L« S E S" s- £ K s s s 

695 7 °° 705 

" a a " a f c 99 c tat tta atg ggc att cat eta cca gtt tat aaa 
Phe He He He Gly Tvr Leu Mer ru, T i= u ■ „ 

' lyi ijeu Mec GJ -y He His Leu Pro Val Tyr Lys 

710 7 15 720 

att tta cca eta ttt tgt att ggt tgt gcc gtc ggg att gta tec ctt 
He Leu Pro Leu Phe Cys He Gly Cys Ala Val Tly He 111 Ser Leu 

725 730 735 

m it ir y ti: m m ?r T tta 9 " «« «=" «• g99 t « 

740 SU Val Leu Phe Thr G1 V Phe 



2398 



2446 



2494 



2542 



2590 



2638 



2686 



2734 



2782 



2830 
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_ a ^ ara C ca tqq cag ttt cca ggt tta ttg etc ggg gtt tgt 

Tin gIu g£ Me? Leu Trp Gin Phe Pro Gly Leu Leu Leu Gly Val Cys 
850 855 860 

ttt act etc tta get aga act ate gat caa aaa gtg aaa aat get ttt 

Se Leu Leu Ala Arg Thr lie Asp Gin Lys Val Lys Asn Axa Phe 

870 875 880 

cca att get att ate tgg att act ttg aca ttg ttt tat ctt aat tta 
Pro 111 Ala lie He Trp He Thr Leu Thr Leu Phe Tyr Leu Asn Leu 

885 890 895 

aat cat att agt tgg cga eta tct ttc tgg ttt ate tta eta ttg tta 
fly His He Ser Trp Arg Leu Ser Phe Trp Phe He Leu Leu Leu Leu 
900 905 910 

gge tta tta gtc att aag cca act etc tat aaa aaa caa ttt att tat 
Gly Leu Leu Val He Lys Pro Thr Leu Tyr Lys Lys Gin Phe He Tyr 
915 920 925 

aac tag gaa gag cgt att aag gat gga ate att ate gtt agt tta atg 
HI Trp Glu 111 Arg lie Lys Asp Gly He He He Val Ser Leu Met 
930 935 940 945 

qga gtt eta ttt tat att gea gga eta eta ttc cct ate agg get cat 
fly val Leu Phe Tyr He Ala Gly Leu Leu Phe Pro lie Arg Ala His 

950 955 960 

att aea ggt ggt agt att gaa egc ctg cat tat ate ata gea tgg gag 
He Thr Gly Glv Ser He Glu Arg Leu His Tyr He He Ala Trp Glu 

965 970 975 

ccg ata gea ttg get acg ttg att ctt act etc gtt tat tta tgt ttg 
Pro S Ila Leu Ala Thr Leu He Leu Thr Leu Val Tyr Leu Cys Leu 
" 980 985 990 

gtt aaa att tta caa gga aaa tct tgt eag att ggt gat gtg ttc aat 
val Lv^ He Leu Gin Gly Lys Ser Cys Gin He Gly Asp Val Phe Asn 
995 1000 1005 

ata aat cgt tat aaa aaa eta ctt caa get tac ggt ggt tct teg gat 
III Lc Arg Tyr Lys Lys Leu Leu Gin Ala Tyr Gly Gly Ser Ser Asp 
1010 * 1015 1020 1025 

age ggt tta gee ttt tta aat gat aaa agg etc tac tgg tac caa aaa 
sir fy Leu Ala Phe Leu Asn Asp Lys Arg Leu Tyr Trp Tyr Gin Lys 

1030 1035 1040 

aat qqa qaa gat tac gtt gcg ttc caa ttt gta att gtc aat aat aaa 
Asn flv Glu Asp Cvs Val Ala Phe Gin Phe Val He Val Asn Asn Lys 

1045 ' 1050 1055 

tgt ctt att atg ggg gaa cca gee ggt gat gac act tat att cgt gaa 
Cys Leu lie Met Gly Glu Pro Ala Gly Asp Asp Thr Tyr He Arg Glu 
1060 1065 10-70 



2878 



2926 



2974 



3022 



3070 



3118 



3166 



3 214 



3262 



3310 



3358 



3406 



3454 



3502 
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Til til It: III III ill III Hi ;r gat eta gac tat gac ctt 

1075 ioho P 3 ASP LyS LeU AS ^ Tyr As P 

1080 1085 



3550 



gtt ttt 

Val Phe 
1090 

ggt ttt 

Gly Phe 



acg ttt 
Thr Phe 



aat aga 
Asn Arg 



£ in ttt i?; 2; r ? ta cac 939 tat 

ma* LeU Leu Leu His Glu Tvr 

1100 

■ L±UU 1105 

gac ttt atg aaa gtt qat oao o^r 

Asp Phe j« Lys J., & »S J" S »■ £ tt. jj. 

1115 1120 
act ctt aaa ggg aat aag tac aaa cct ttc aaa a ^ «- 

1111 ^ ^ ryr Lys Pro ^ ^ Tsn 111 £ 

1130 1135 

Va^ 9 f " C t3t ttC 9 aa 9" 3ta caa teg cca 

Val^lu Lys Asp Gly Phe Tyr Phe Glu Val Val Gin Sef p" 

1145 1150 



3598 



His" Glr liu r r Cta aat a9t " 3 9aa att tct 

His Se Glr. Glu Leu Leu Asn Ser Leu Glu Glu lie Ser 

1160 1165 

tta gaa gga cgt cct gaa aaa qqt ttc ^^ 

JJ. ... «„ SI9 Pro „„ Lys » y HI HI «» g» £ 

1175 1180 

gat tat ttc caa caa qcc cca *t-a ^ 

A-P Tyr Phe Gin Gin La p" 111 £j £l £ A^n 

H90 1195 7 

Olu ?S ^a Phe' I£ £ " 9 C " ** C tat 

V™= Ile Met Pro Asn T y^ giu 

1210 



aat act tgg 
Asn Thr Trp 



ate tct 
Ile Ser 



att gat tta atg cat cac aat- ^ 

-1- jy 3 yu ( ~ £iC g a t aaa cag aaa att 

1220 P SU ^ H1S ASP Lys G1 » ^ He 

° 1225 1230 



ttt aat aaa 
Phe Asn Lys 
1185 

get gaa cac 
Ala Glu His 
1200 

aag agt att 
Lys Ser lie 
1215 

ccg aat ggc 
Pro Asn Gly 



gtt atg gat ttc etc ttt tta tea tta ttc tct t- - 

Ss 1=P "» "» S £ Su' £ 

X^4U 1245 

S Tvr HI itl ttt f" " 3 999 at9 9Ca CCt " a 
1,5 0 Y HlS Tyr Phe As P Leu Met Ala Pro Leu Ser 

1255 1260 



cgc get gaa aca aqt ttt art- = = = 

Arg val Glu Thr Ser Phe lla Lvs V* m' 9 9 ° 9 t3t 

L,y S Glu Arg Met Ala Tvr 
1270 12?5 Y 

cat ttc ggc agt cat ttc tac tea ttt aat «rrt- 

His Gly „ Phe ^ ser £ *« HI «; =« 

1285 1290 



gga gtt ggt 
Gly Val Gly 
1265 

ctt gtc tat 
Leu Val Tyr 
1280 

aag tat aag 
Lys Tyr Lys 
1295 



3646 



3694 



3742 



3790 



3838 



3886 



3934 



3982 



4030 



4078 



4126 



4174 
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aa 9 aag ttt aea cca ttg tgg teg gaa cgt tat att tet tgt tec egt 
Lys Lys Phe Thr Pro Leu Trp Ser Glu Arg Tyr lie 
1300 1305 



4222 



2 S S S S £ S S K K £ =3 K K E 

_ 119(1 = 



4270 



1315 1320 



aaa att aag att gtt aaa taagctttat ttggcaatta aaaagagcat 



Lys He Lys He Val Lys 
1330 1335 



<210> 2 
<211> 154 
<212> PRT 

<213> Streptococcus 



Ser Clylvl Glu Pro Ala Asn Arg Phe Ser Trp Ala Lys Asn Lys Leu 
Leu He Asn C-ly Phe He Ala Thr Leu Ala Ala Thr He Leu Phe Phe 
Al a val Gin Phe lie Gly Leu Lys Pro Asp Tyr Pro 'Gly Lys Thr Tyr 
Ph e He lie Leu Leu Thr Ala Trp Thr Leu Met Ala Leu Val Thr Ala 
Leu val Gly Trp Asp Asn Arg Tyr Gly Ser Phe Leu ser Leu Leu lie 
Leu Leu Phe Gin Leu lly Ser Ser Ala Gly Thr Tyr Pro He Glu Leu 
ser Pro Lys Phe Phe Gin Thr Xle Gin Pro Phe Leu Pro Met Thr Tyr 
Ser Val Ser ITy Leu Arg Glu Thr He Ser Leu Thr Gly Asp Val Asn 
Hi. Gin Trp Arg Met Leu Val fie Phe Leu Val Ser Ser Met He Leu 



135 140 
Ala Leu Leu lie Tyr Arg Lys Gin Glu Asp 
145 150 

<210> 3 
<211> H8 
<212> PRT 

<213> streDtococcus 



4318 



gtc « 9 c 9 *= .«■«=«« iiiitlllll sssss 

isi suses i"iu~> :m 

agcttttttt acttga 



Met Ser^hr Leu Thr Xle He He Ala Thr Leu Thr Ala Leu Glu His 

Tyr He Met Tyr Leu Glu Thr Leu aL Thr Gin Ser Asn Met Thr 

20 25 
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«y u,. u. Phe „ r Msc .„ „„ „ u leu set l ^ ^ ^ 

.1. Ly, Uu Ph . lys v ^ t ^ a ^ „ u .^ ^ ^ 

Phe Leu Leu Tyr Gly Leu Tyr I le Ser G , a *° 

65 7Q > " e s er Gln Asn Gin Glu lie Val Ala 

Val Phe Leu lie Asn v=i t„ 75 ro 

Asn Val Leu Leu Val Ala lle Tyr Gly ^ ^ ^ 

Val Asp Lvs Lys He t~,, t« 95 
. U P lie Leu Leu Lys Gin niv rixr r ~ 

100 Y ^ G± y G1 y L ^u Pro He Leu Ala 

Leu Leu Thr Phe Leu Phe 110 



115 

<210> 4 
<211> 247 
<212> PRT 

<213> Streptococcus 
<400> 4 



Met Thr Glu Asn Trp Leu Hi* Th. T 
± Leu his Thr Lys Asp Gly Ser Asp Ile Tyr ^ 

v.. v al oj, »„ 01 „ al „ Pro » ph> l ^ ^ y , L ^ 
„, .„ « g Tyr Phe A , p „ ne ai< ^ ^ 

Tyr Gin Val H e V al Met A.n * 45 

50 *J P SSr Ar 9 G ^ His Gly Lys Ser His Ala 

Lys Leu Asn Thr lie Ser Phe Ara ri n T i 6 ° 

65 70 9 116 Ala Val Leu Lys Asp 

He Leu Val His Leu Glu lie Asd t v « v ■, 3? 80 

8s Asp L ys Val H e Leu Val Gly His Ser 

Asp Gly Ala Asn Leu Ala Leu Val Ph„ u 95 

100 91 fj* Gln Thr Met Phe Pro Gly Met 

Val Arg Gly Leu Leu Leu Asn Ser rif, a 110 

115 n I" G ^ Asn L eu Thr lie His Gly Gln 

Arg Trp Trp Asp ll e Leu Leu Val 125 

130 13s ^ Ar 9 Ile Ala Tyr Lys Phe Leu His 

Tyr Leu Glv Lvs Lph Dh~ o m 140 
-mr " Phe Pro Tyr Met Ara n„ , 

145 150 y " ec A ^g Gln Lys Ala Gln Val lie 

Ser Leu Met Leu Glu Asp Leu Lvs n- e„ l** 160 

P Lys lie ser Pro Ala Asp Leu Gln His 

Val Ser Thr Pro Val Met Val Leu Val ri, a , 175 

180 ^ Asn Lys Asp He lie Lys 

Leu Asn His Ser Lys Lys Leu Ala Ser Tvr Php D 190 

195 ber T ^ r p he Pro Arg Gly Glu Phe 

Tyr Ser Leu Val Gly Phe Gly His His t ]p t1 t 205 

210 21 Y S Hls His X1 e He Lys Gln Asp Ser His 

Val Phe Asn Il e Ile Ala L 22 0 

225 23Q y L ^ S Phe He Asn Asp Th r Leu Lys Gly 

Glu Ile Val Glu Lys Ala Asn 235 240 



245 

<210> 5 
<211> 816 
<212> PRT 

<213> Streptococcus 
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Met Ile'His Leu Lys Arg Thr lie Ser Val Glu Gin Leu Lys Ser V.l 

Phe Gly Gin Leu Ser Pro Met Asn Leu Phe Leu He lie Leu Val Gly 

2 0 ^ ^ 

Val lie Ala Val Leu Pro Thr Thr Gly Tyr Asp Phe Val Leu Asn Gly 

,e 40 45 

Leu Leu Arg Thr Asp Lys Ser Lys Arg Tyr lie Leu Gin Thr Ser Trp 

55 

CyS He Asn Thr Phe Asn Asn Leu Ser Gly Phe Gly Gly Leu He Asp 

7 0 ^ ^ 

lie Gly Leu Arg Met Ala Phe Tyr Gly Lys Lys Gly Gin Glu Lys Ser 

8 5 ^ ^ 

Asp Leu Arg Glu Val Thr Arg Phe Leu Pro Tyr Leu He Ser Gly Leu 

Ser Phe lie Ser Val He Ala Leu He Met Ser His lie Phe His Ala 

12 0 12b 

L vs Ala III Val Asp Tyr Tyr Tyr Leu Val Leu lie Gly Ala Ser Met 

13 5 

Tyr III Pro Val He Tyr Trp lie Ser Gly His Lys Gly Ser His Tyr 

150 155 
P^e Gly Asp Met Pro Ser Ser Thr Arg He Lys Leu Gly Val Val Ser 

16 5 17 0 

Phe Phe Glu Trp Gly Cys Ala Ala Ala Ala Phe He He He Gly Tyr 

18 0 185 
L eu Met Glv He His Leu Pro Val Tyr Lys He Leu Pro Leu Phe Cys 

2 0 0 z\j~> 
He Gly III Ala Val Gly He Val Ser Leu He Pro Gly Gly Leu Gly 

215 22 0 

Ser Phe Glu Leu Val Leu Phe Thr Gly Phe Ala Ala Glu Gly Leu Pro 

2 3 0 2 3 5 

HI Glu Thr val Val Ala Trp Leu Leu Leu Tyr Arg Leu Ala Tyr Tyr 

245 250 
lie He Pro Phe Phe Ala Gly He Tyr Phe Phe He His Tyr Leu Gly 

ICC £. I \) 

2 6 0 

Ser Gin He Asn Gin Arg Tyr Glu Asn Val Pro Lys Glu Leu Val Ser 

c 2 8 0 Zod 

Thr Val Leu Gin Thr Met Val Ser His Leu Met Arg He Leu Gly Ala 

2 95 3 0 0 

Phe Leu He Phe Ser Thr Ala Phe Phe Glu Asn lie Thr Tyr He Met 

310 315 
Trp Leu Gin Lys Leu Gly Leu Asp Pro Leu Gin Glu Gin Met Leu Trp 

3 2 5 3 3 0 

t n v rvs Phe lie Leu Leu Ala Arg 

Gin Phe Pro Gly Leu Leu Leu Gly Val cys pne xxc ^ 

34 0 3 4 5 

Thr He Asp Gin Lys Val Lys Asn Ala Phe Pro He Ala He He Trp 

355 360 

He Thr Leu Thr Leu Phe Tyr Leu Asn Leu Gly His He Ser Trp Arg 

r*70 375 380 

L eu Ser Phe Trp Phe He Leu Leu Leu Leu Gly Leu Leu Val He Lys 

3 9 0 3 9 5 

III Thr Leu Tyr Lys Lys Gin Phe He Tyr Ser Trp Glu Glu Arg He 

4 0 5 410 

Lys Asp Glv He lie lie Val Ser Leu Met Gly Val Leu Phe Tyr He 
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Ala Gly 

Glu Arg 
450 
Leu lie 
465 

Lys Ser 
Leu Leu 
Asn Asd 



Ala Phe 
530 
Pro Ala 
545 

Asp Asp 

Gin Lys 

Val Gly 

Asn Lys 
610 
Gly Phe 
625 

Asn Ser 



Leu Leu 
435 

Leu His 

Leu Thr 

Cys Gin 

Gin Ala 
500 
Lys Arg 
515 

Gin Phe 



Phe Pro 

Tyr He 

Leu Val 
470 
He Gly 
485 

Tyr Gly 
Leu Tyr 
Val He 



Gly Asp 

Ala Asp 

Leu Thr 
580 
Glu Asp 
595 

Tyr Lys 
Tyr Phe 
Leu Glu 



Lys Gly 

Pro He 

Asn He 
690 
Arg His 
705 

Leu Ser 
Leu Gly 
Ala Lys 



Tyr Ser 
770 
Trp Ser 
785 

Ala lie 



Phe Ser 
660 
Ala Leu 
675 

Met Pro 

Asp Lys 

Leu Phe 

Met Ala 
740 
Glu Arg 
755 

Phe Asn 



Asp Thr 
550 
Lys Leu 
565 

Leu Leu 

Ala Leu 

Pro Phe 

Glu Val 
630 
Glu He 
645 

Leu Gly 



He Arg Ala 

440 
He Ala Trp 
455 

Tyr Leu Cys 

Asp Val Phe 

Gly Ser Ser 
505 

Trp Tyr Gin 
520 

Val Asn Asn 
535 

Tyr He Arg 



His He 
Glu Pro 



Leu Val 
475 
Asn Val 
490 

Asp Ser 
Lys Asn 
Lys Cys 



Asp Tyr Asp 

Leu His Glu 
585 

Val Asn Leu 
600 

Arg Asn Ala 
615 

Val Gin Ser 
Ser Asn Thr 



Val Lys 

Asn Tyr 

Gin Lys 
710 
Ser Tyr 
725 

Pro Leu 
Met Ala 
Gly Leu 



Glu Arg 
Cys Ala 



Tyr He 
790 
Leu Leu 
805 



Tyr Phe Asn 
665 

Asn Ala Glu 
680 

Glu Lys Ser 
695 

He Pro Asn 

Tyr Gin Glu 

Ser Gly Val 
745 

Tyr Leu Val 
760 

His Lys Tyr 
775 

Ser Cys Ser 



Met Glu Asp 



Glu Ala 
555 
Leu Val 
570 

Tyr Gly 

Glu Thr 

Leu Asn 

Pro His 
635 
Trp Leu 
650 

Lys Asp 

His Glu 

He He 

Gly Val 
715 
Lys Gly 
730 

Gly Arg 

Tyr His 

Lys Lys 

Arg Ser 
795 
Ser Lys 
810 



Thr Gly Gly Ser He 
445 

He Ala Leu Ala Thr 
460 

Lys He Leu Gin Gly 

480 

Asp Arg Tyr Lys Lys 

495 

Gly Leu Ala Phe Leu 
510 

Gly Glu Asp Cvs Val 
525 

Leu lie Met Gly Glu 
540 

He Glu Ser Phe lie 

560 

Phe Tyr Ser lie Gly 

575 

Phe Asp Phe Met Lys 
590 

Phe Thr Leu Lvs Gly 
605 

Arg Val Glu Lys Asp 
620 

Ser Gin Glu Leu Leu 

640 

Glu Gly Arg Pro Glu 

655 

Tyr Phe Gin Gin Ala 
670 

Val Val Ala Phe Ala 
685 

Ser lie Asp Leu Met 
700 

Met Asp Phe Leu Phe 

720 

Tyr His Tyr Phe Asp 

735 

Val Glu Thr Ser Phe 
750 

Phe Gly Ser His Phe 
765 

Lys Phe Thr Pro Leu 
780 

Ser Trp Leu He Cys 

800 

He Lys lie Val Lys 

815 



<210> 6 
<211> 518 
<212> PRT 

<213> Streptococcus 



<400> 6 
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Met Arg He Leu Gly Ala Phe Leu He Phe Ser Thr Ala Phe Phe Glu 

Asn lie Thr Tyr lie Met Trp Leu Gin Lys Leu Gly Leu Asp Pro Leu 

Gin Glu Gin Met Leu Trp Gin Phe Pro Gly Leu Leu Leu Gly Val Cys 

Phe He Leu Leu Ala Arg Thr He Asp Gin Lys Val Lys Asn Ala Phe 

Pro iJe Ala lie He Trp lie Thr Leu Thr Leu Phe Tyr Leu Asn Leu 

Gly His lie Ser Trp Irg Leu Ser Phe Trp Phe He Leu Leu Leu Leu 

85 90 
Gly Leu Leu Val He Lys Pro Thr Leu Tyr Lys Lys Gin Phe He Tyr 

Ser Trp Glu III Arg He Lys Asp Sy He He lie Val Ser Leu Met 

120 LZz> 

Gly val Leu Phe Tyr He Ala Gly Leu Leu Phe Pro He Arg Ala His 

lie Thr Gly Gly Ser He III Arg Leu His Tyr Tie He Ala Trp Glu 

III He Ala Leu Ala rll Leu He Leu Thr ill Val Tyr Leu Cys Leu 

val Lvs He Leu III Gly Lys Ser Cys Gil He Gly Asp Val Phe Asn 

185 -L"v 
val Asp Arg Tyr Lys Lys Leu Leu Gin Ala Tyr Gly Gly Ser Ser Asp 

t 9 5 200 ^vjd 

Ser Gly Leu Ala Phe Leu Asn Asp Lys Arg Leu Tyr Trp Tyr Gin Lys 

Asn lly Glu Asp Cys Val Ala Phe Gin Phe Val Til Val Asn Asn Lys 

HI Leu He Met Gly III Pro Ala Gly Asp Asp Thr Tyr He Arg Glu 

Ala He Glu Ser lil He Asp Asp Ala Asp Lys Leu Asp Tyr Asp Leu 

Val Phe Tyr Ser He Gly Gin Lys Leu Thr Leu Leu Leu His Glu Tyr 

Gly Phe Asp Phe Met Lys Val Gly Glu Asp Ala Leu Val Asn Leu Glu 

Thr III Thr Leu Lys Gly Asn Lys Tyr Lys Pro Phe Arg Asn Ala Leu 

III Arg Val Glu Lys Asp Gly Phe Tyr Phe III Val Val Gin Ser Pro 

His Ser Gin Glu III Leu Asn Ser Leu Glu Glu He Ser Asn Thr Trp 

40 345 j->u 

L eu Glu Gly Arg Pro Glu Lys Gly Phe Ser Leu Gly Tyr Phe Asn Lys 

-5 360 36b 

Asp Tyr Phe Gin Gin Ala Pro He Ala Leu Val Lys Asn Ala Glu His 

370 375 380 

Glu Val Val Ala Phe Ala Asn He Met Pro Asn Tyr Glu Lys Ser He 

III Ser lie Asp Leu III Arg His Asp Lys Gin Lys He Pro Asn Gly 

Val Met Asd Phe III Phe Leu Ser Leu Phe Ser Tyr Tyr Gin Glu Lys 

* 420 425 
Glv TV- His Tyr Phe Asp Leu Gly Met Ala Pro Leu Ser Gly Val Gly 

440 445 
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Arg Val Glu Thr Ser Phe Ala t ve 

450 JJ* LyS Glu Ar S Met Ala Tyr Leu Val Tyr 

His Phe Gly Ser His Phe Tyr Ser Ph<=> ae pi 46 ° 

465 47Q iyr Ser Phe Asn Gly Leu His Lys Tyr Lys 

Lys Lys Phe Thr Pro Leu Trn Ser pin a , 480 

4 85 P 1U Ar9 Tyr Ile Ser Cys Ser Arg 

_ 4 on ^ 

Ser Ser Trp Leu He Cys Ala T1„ rv ^ r 495 

500 7 3 C s Y Q l Ala Leu Leu Met Glu Asp Ser 

Lys He Lys He Val Lys 510 
515 



<210> 7 
<211> 5126 
<212> DNA 

<213 > Streptococcus 

<220> 

<221> CDS 

<222> (1) . . . (687) 

<221> CDS 

<222> (701) . . . (2557J 
<221> CDS 

<222> (2566) . . . (3036) 
<22l> CDS 

<222> (3106) . . . (4842) 
< 2 2 1 > CDS 

<222> (4850) . . . (5125) 
<400> 7 



aat ttt gat ate gaa aca aca art rrr 

Asn Phe Asp ll e Glu Thr Thr Th J J" 9*9 9ca atg aaa aag cac gcg 
x ^ inr Thr Phe Glu Ala Met Lys Lys His Ala 



10 



S« III Til Lys 3 Til lit ?£ ^ tct 9aa ttt gat 

2Q ys ser val Glu Arg Ser Phe lie Glu Phe Asp 



25 30 



aaa ctt eta tta gca cct tat taa 

Lys Leu Leu Leu Ala Pro Tvr t f " " a at9 Ctg gca cta ^ta 

^ Aia Pro Tyr Trp Arg Lys Gly Met Leu Ala Leu He 



4 ° a rr 

45 



gat agt cat get ttt aat tat cta cca tar rr, a 

Asp ser His Ala Phe Asn Tyr Leu p" clt r *" 3 " 933 tta 

50 ^ U Pro Leu Lys Asn Arg Glu Leu 

" 60 



caa tta age gee ttt ttg tec caa rr* ~ = - 

Gin Leu Ser Ala Phe Leu Ser Gin 2eu Isn ?** ?" "* ^ 

65 7n Asp Lys As P phe Leu Phe Glu 

7S 

^ 80 
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Leu Asp Lys Val Ala Asn Lys He He !i! Dh „, 495 

500 5J5 Ala Phe Glu Asp Asn Asp He 

Arg Glu Phe Phe Gly Asn Tyr Thr Asn T^r- t » 510 

515 1 P " LyS Ala Phe 

Asn Glu Gin Asn Asn Glu Val Ile Ser L yS Lys Glu S Thr Lys Thr 

Ser Arg Glu Lys Gin Ser Arg Lys Arg Met Ser Tyr Phe Glu Lys Gin 

Glu Trp Ala Thr lie Glu Asn a<=r-> n „ 555 560 

__xe giu Asp Asp He Met lie Leu Glu Asn Thr il e 

Thr Arg He Glu Asn Asp Met Gin Thr- ^ 575 

580 5 £5 Cys Ser Asp Phe Thr Arg 

Leu Ser Asp Leu Gin t v= ni,, T » 590 

5 95 ^ G1U ^ AS ^ A1 * Lys Asn Glu Ala Leu Leu 

Glu Lys Tyr Asp Arg Tyr Glu Tyr Leu Ser Pin r f° 5 

610 y ber Glu Leu Asp Thr 

615 620 

<210> 10 
<211> 157 
<212> PRT 

<213> Streptococcus 
<400> 10 

Met lie He Arg Pro ll P n ~ T „ - 
x . Pro lie lie Lys Asn Asp Asp Gin Ala Val Ala Gin 

Leu He Arg Gin Ser Leu Arg Ala Tvr IL r * 15 

2 0 y Tyr As P Leu Asp Lys Pro Asp Thr 

Ala Tvr Ser Asd Pro w-i e r ~, * 25 30 

- ^ sp Pro HlB Leu Asp HlS Leu Thr Ser Tyr Tyr Glu Lys 

Ha Glu Lys Ser Gly Phe ph e val Ile olu Glu Arg « p ^ ^ ^ 
Oly Cys Gly Gly Phe Gly Pro Leu Lys Asn Leu iTe Ala Glu Met Qln 

Lys Val Tyr He Ala Glu Arq Phe Am m , 80 

g5 u Arg Phe Arg Gly Lys Gly Leu Ala Thr Asp 

Leu Val Lvs Met Il e Glu Val Glu Ala lla t ti 95 

100 ?:* Arg Lys Ile G1 Y Tyr Arg Gin 

Leu Tyr Leu Glu Thr Ala Ser Th.- t «, o 110 

115 ^0 ^ 9 Thr Ala Val Tyr 

Lys His Met Gly Tyr Cvs Ala Leu Ser n t, d r, 125 

130 * 135 ej Ser Gln p ^o He Ala Asn Asp Gin 

Gly His Thr Ala Met Asp Il e Trp Met Tl. t 14 ° 
145 P Met Ile L VS Asp Leu 

U 155 



<210> li 
<211> 579 
<212> PRT 

<213> Streptococcus 
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, <4 ^ 0> tL rrv Ser Tyr Leu Lys Arg Tyr Pro Asn Trp Leu Trp 
Met Ala Tyr He Trp ber iyj- 1 10 15 

1 5 r^u~ w^i rnr val He Leu Glv Met 

- t pi,, ziia Met Leu Phe Val Tnr vai j-j-c 
Leu Asd Leu Leu Gly Ala Net u ^ 3Q 

Pro Thr .1, £. »>• o»y »t .1. -P «» ™* WS G1 " ASP 

„ g Thr Siy v.l V -» Trp Sr Ph. U. « JJ- II. Ph. v.l v.! 

L « u S, ii. ii. «y « "« Thr »' " a ^ r "* s " s " " 9 

65 70 » irn Asn Asp Met Tvr Ala Lys Leu 

Thr Thr Thr Met lie Arg Asp Met Arg Asn Asp ^ 

c . Sis His Glu ^yr Glu Gin He Gly Val Ser Ser Leu 
Gin Glu Tyr Ser His His biu -y- 110 

val Thr Arg Met Thr Ser Asp Thr Phe Val Leu Met Gin Phe Ala Glu 
Met Ser III Arg Leu Gly Leu Val Thr Pro Met Val Met He Phe Ser 

130 , T T1P Thr Ser P-o Ser Leu Ala Trp Leu Val Ala 

Val Val Met He Leu He Thr ser e.a => ^ 16Q 

I 45 I 50 w=t rlv Val Val Leu Tyr Val Ala He Lys 

Val Ala Met Pro Leu Leu Val Gly Val va. ^ 

t e" Glu Arq Gin Gin Thr Met Leu Asp Lys He Asn 
Thr Lys Pro Leu Ser Glu Arg hj." 190 

180 , T ^ rlv Leu Arq Val Val Arg Ala Phe 

Gin Tyr Val Arg Glu Asn Leu Th. Gly Leu Arg ^ 

X f * PhP Gin Ser Gin Lys Phe Gin Val Ala Asn Gin Arg 
Ala Arg Glu Asn Phe Gin ber u-i y ^ 

210 u e Thr 111 Leu Phe Lys Leu Thr Gly Leu Thr Glu 

Tyr Thr Asp Thr Ser Thr Gly Leu fne y ^ 24Q 

HI Leu Phe val Gin He He He Ala Met He Val Ala He Val Trp 
Phe Ala Leu Asp Pro Leu Gin Arg Gly Ala He Lys He Gly Asp Leu 

, =v nlu T V r Ser Phe His Ala Leu Phe Ser Phe Leu Leu 

Val Ala Phe He Glu Tyr ber r ^ 

275 ^ Tvr Pro Arq Met Val Val Ser Ser His 

Phe Ala Asn Leu Phe Thr Met Tyr Pro Arg m ^ 

290 2 „ q _, Ti e Asn Pro Asn Ala 

Arg He Arg Glu Val Met Asp Met Pro He Ser He Asn ^ 

305 3 l° t „ t»„ tvs Glv His Leu Glu Phe Asp Asn 

Glu Gly Val Thr Asp Thr Lys Leu Lys Gl> His ^ 

val Thr Phe Ala Tyr Pro Gly Glu Thr Glu Ser Pro Val Leu His Asp 
Ile ser Phe Lys Ala Lys Pro Gly Glu Thr He Ala Phe He Gly Ser 
Thr Gly fer Gly Lys Ser Ser Leu Vai Asn Leu lie Pro Arg Phe Tyr 

370 375 w _ ^ r1v Val Asp val Arg Asp 

Asp val Thr Leu Gly Lys lie Leu Val Asp Gly val Asp a ^ 

385 390 r, n„ tvs lie Glv Phe He Pro Gin Lys 

Tyr Asn Leu Lys Ser Leu Arg Gin Lys He eiy ^ 

, *°l rlv Thr ne Glv Glu Asn Leu Lys Tyr Gly Lys 
Ala Leu Leu Phe Thr Gly Tnr 11- 430 

m r\ 4 2 J 
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Ala Asp 

Ala Lvs 
450 
Ala Glu 
465 

He Ala 
Ser Phe 
Leu Lvs 



Val Gly 
530 
Glu He 
545 

He Tyr 



Ala Thr 
435 

Glu Phe 

Gly Gly 

Arg Ala 

Ser Ala 
500 
Glu Val 
515 

Thr He 



lie Asp Asp 

He Glu Ser 
455 

Ser Asn Leu 

470 
Val Val Lys 
485 

Leu Asp Tyr 



Leu Arg 
440 

His Gin 
Ser Gly 
Asp Pro 



Lys Thr 
505 

Thr Gly Asp Ser Thr 

520 

Asp Gin 



Val Gly 
Arg Glu 
Glu Glv Glu 



Met Asp Ala 
535 

Arg Gly Thr 

550 
He Ala Glu 
565 



His Ala 
Ser Gin 



Gin Ala Val 

Glu Ala Phe 
460 

Gly Gin Lys 
475 

Asp Leu Tyr 
490 

Asp Ala Thr 

Val Leu He 

He He Val 
540 

Gin Leu He 
555 

Leu Lys Asn 
570 



Asp He Ser Gin 
445 

Glu Thr His Leu 

Gin Arg Leu Ser 

480 

He Phe Asp Asp 
495 

Leu Arg Ala Arg 
510 

Val Ala Gin Arg 
525 

Leu Asp Glu Gly 

Glu Asn Asn Ala 

560 

Gin Asn Leu Ser 
575 



<210> 12 
<211> 92 
<2 12> PRT 

<213> Streptococcus 
<400> 12 

Met Arg Lys Lys Ser Val Phe t ^ a 
± s Phe Leu Arg Leu Trp Ser Tyr Leu Thr Arg 

Tyr Lys Ala Thr Leu Phe Leu Ala tu 15 

20 Ala " e Phe ^ Lys Val Leu Ser Ser 

»» v,: ^ ala Pro - am ^ ay ^ ^ „ ^ ^ 

-» lT „. „„ Leu vel Met sl> Lys nr w „ ^ ^ ^ 

Leu Asn Val Pro Tyr lie ai= m t-. 60 

65 ^ " e Ala ^ Ile Leu lie He Tyr Phe Phe Arg 

Gly Val Phe Tyr Glu Leu Qly ser ^ q 7^ 

90 

<210> 13 
<211> 5215 
<212> DNA 

<213> Streptococcus 

<220> 

<221> CDS 

<222> (3) . . . (122) 

<221> CDS 

<222> (133) . . . (2511) 
<221> CDS 

<222> (367) . . . (2511) 
<221> CDS 
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<222> (2946) . . - (2716) 

<223> of complementary strand 

<221> CDS 

<222> (3252) . . . (2995) 

<223> of complementary strand 

<221> CDS 

<222> (3676) . . - (3299) 

<223> of complementary strand 



<221> CDS 

<222> (4124) . . - (3837) 

<223> of complementary strand 



c221> CDS 

<222> (5214) . . - (4351) 

<223> of complementary strand 



<400> 13 „_ at -t att aqt 47 



\ ^ rh- rrs aca Qtt qaa gta aag gag att duu dyu. 
" HI 1% - E Si IT, Si ?.l Slu 5al .ys Olu He He Ser 



1 5 



L ag gag att att agt 

I 

10 

sr. sr. m s: s s s s s »" s s s 

20 25 

i-i-a rra act tqq taagcatcat atg ggt eta gca 
tac tea tat tgg aag tta cca act tgg g q ^ ^ 

Tyr Ser Tyr Trp Lys Leu Pro Thr Trp 

35 40 

« = +-t- orr tat att qat gac age aaa ggt aag gca 

s m m sx ;r. sr. £ xi. -p »•* no 



50 55 
45 DU 



^ ^ aaa acq atq gat caa ate agt get gaa gaa 

t" Til in s "» £ s »s Lp - «. - »• «■ 



65 70 



s :r. e s s; ss s: £ 9 - *" - - - -J * - 



80 85 



; 3 e s« sr. ss k s: i £ - s "» is "i & £ 
s sr. - - - sr. s; m s s? s s £ £ £ 



115 120 
110 11 3 



= rt *r att ate aat gaa ate tta gac ggt tac gtt 

i% m sr. in in 8 »i «• «- ~. n ; - "p ^ \:i 

125 



95 



144 



192 



240 



288 



336 



384 



432 
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att aaa gtc aat g gc aac tat tar art- 

He Lys Val Asn Gly Asn Tyr Tvr Va! ^vr llu f* 9 99t * 9t 339 480 

14 5 * 7 ^ eu Lys Pro G1 Y Ser Lys 



150 155 



cgc aaa aac att caa acc aaa c 

Ly „„ „. s ; 9 Thr Ly , = « «• J« jet »-» =.. £. e« 

1 f^l 



165 

170 



"Ty Thr Lys gTu IS £ £1 £ Q^y Le u ^ f 9 f « «t etc 

175 * y ueu Ala Gln v al Ala His Leu 

180 185 



agt aaa gaa gaa gtt qca ara 

Ser Lys Glu Glu Val | la 9 ^ ^ 9« gca aaa aga caa gga cgc 

190 1Q _ Vai Asn Glu Al a Lys Arg Gin Gly Arq 

" 200 



2 15 

220 



230 235 



att cct aaa aag gat tta tct 

He Pro Lys Lys Lp Leu sir tro Ser Ilu T I? 9 ° C 9Ca " a ^ 

24 0 2" Glu Leu Ala Ala Ala Gin Ala 



245 250 



tac tgg agt caa aaa caa ggt caa oai- 

Tyr Trp Ser Gin Lys Gin Ify A rg *?J |£ ^ «9 tct gat tac cgc 

255 y Ala Ar 9 p ro Ser Asp Tyr Arg 

2 65 



"i in it: tit sj s s L" ir r* *" ~« 9 « « «■ 

2 70 7 Arg Ala He Pro Asp Val Thr 

* 280 

Pro a" Pro Gly Gin Ify h£ g!h pf " C 99t " C tat Cat CCa 

285 - °90 ASP ASn Gly G ^ ^ His Pro 



295 300 



gcg cct cct agg cca aat gat acq tea cz* aa 

Ala Pro Pro Arg P ro Asn Asp Ala s" Gin a" " a 393 9at 

3 05 Gln ASn ^ His Gin Arg Asp 



310 315 



gag ttt aaa gqa aaa am t-t-- 

Glu Phe Lys Ify "s Thr Jhe Lyf flu f" " a 93t Caa cta «c cgt 

. nr Phe Lys Glu Leu i,eu Asp Gin Leu His Arg 



528 



325 330 



ctt gat ttg aaa tac cat r*r rn-~ 

- A5P T - - =" s: £ ss s m m s 

0 345 



576 



ccg act caa gtg ate aaa tea aac act ri-i- „ 

Pro Thr Gin Val lie Lys Ser A^n li It " 9 tat 9tg g tg cct cat 

350 ' Asn Ala P ^ Gly Tyr Val Val Pro His 

" 360 



624 



672 



gat tta gga gat get tat tta qta cct r*r ^ 
Asp Leu Gly Asp Ala Tyr Leu Val Pro H 1S Gly J" ^ C3t tat ™ 

225 Gly Asn His Tyr His Tyr 



768 



816 



864 



91: 



960 



1008 



1056 



1104 
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= „»t cat cat cat att ate cca aga agt cag tta tea cct ctt gaa 
gga gat cat tat cat a ger prQ Leu Qlu 

Gly Aso His Tyr His lie lie Pro Arg sei. 
365 " 370 375 

= ^ o* a tta qca gat cga cac tta get ggc caa act gag gac aat gac 
Me? ITu Leu Ala Lp Arg Tyr Leu Ala Gly Gin Thr Glu Asp Ajn Asp 

385 390 



in % s sr. sr. ss « ~ s 52 i »" s 

400 405 

* + * ctt act cat cgc ate aaa get tac gga aaa ggc tta gat ggt aaa 

Phe Leu fly "s A?g He Lys Ala Tyr Gly Lys Gly Leu Asp Gly Lys 

*-ic 420 



i-ar aat acq aqc gat get tat gtt ttt agt aaa gaa tec att cat 
Tro Tyr asp Thr Ser Lp La Tyr Val Phe Ser Lys Glu Ser He His 



435 440 
430 4 " ib 



- a aa r aaa tea gga gtt aca get aaa cac gga gat cat ttc cac 

HI ?3 HI Lys Ser Sy Val Thr Ala Lys His Gly Asp His Phe His 

4 5 0 45b 
445 450 

. = aaa ttt qqa gaa ctt gaa caa tat gag ttg gat gag gtc get 

Tyl He Sy Phe Gly Glu Leu Glu Gin Tyr Glu Leu Asp Glu Val Ala 

465 470 

aac tgg gtg aaa gea aaa ggt caa get gat gag ctt get get get ttg 
lln S? Val Lys Ala Lys Gly Gin Ala Asp Glu Leu Ala Ala Ala Leu 

480 485 

i% in sr. 3" ss "s sr. i" i" £ m ill s *" s ?2 

r 500 b0t> 



agt cgc aaa gta aca aaa gat ggt aaa gtg ggc tat atg atg cca aaa 
Ser Arg Lys Val Thr Lys Asp Gly Lys Val Gly Tyr Met wet v y 



510 



515 520 



<- a ,n a *c tat ttc tat get cgt gat caa ctt gat ttg act cag 

Lp % Ly 9 s HI Tyr III Tyr La Arg Asp Gin Leu Asp Leu Thr Gin 
525 530 " 5 

att gee ttt gee gaa caa gaa eta atg ctt aaa gat aag aag cat tac 

Til Ala Phe La Glu Gin Glu Leu Met Leu Lys Asp Lys Lys His Tyr 

545 550 

cot tat gae att gtt gac aca ggt att gag eea ega ctt get gta gat 
Sg Tyr Lp He Val Asp Thr Gly He Glu Pro Arg ,eu Ala Val Asp 

560 565 

52 s" s; m s s s: £ & £ s e s K £ ss 

575 580 585 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



BNSDOCID: <WO 994258BA2_I_> 



WO 99/42588 

PCT/CA99/00114 

29 / 63 

agt teg ttt gtc acc cca cat ar- 

Ser ser Phe Val lie o ro 9=C Cat aCc «t gec gtt ccg tat 

590 *H Ile As P His He His Val Val Pro Tyr 



1824 



tea tgg teg acg cgc gat caa atr 

Ser Trp Leu Thr Arg Asp Gin He j£? f£* ? 39 t3C gt 9 at 9 =aa 

SOS gl J iie A1 = Thr Val Lys Tyr Val Met Gin 



615 620 



cac ccc gaa gtt cgt ccq qat at* 

His Pro Glu Val Arg Pro Asp vll S q" f* 9 CC3 999 Cat gaa 9 a 9 

625 AS P Trp ser Lys Pro Gly His Glu Glu 

£ 1 n 



630 635 



tea ggt teg gtc att cca aat- att 

Ser Gly Ser Val lie Pro Asn Val Ctt 9 * C aaa CgC 9 ct 99t 

640 Asn Val Thr Pro Leu Asp Lys Arg Ala Gly 

645 650 



atg cca aac tgg caa att ate cat tct 

Met Pro Asn Trp Gin lie Ils His 111 ?f 9 ? 3 933 9Ct caa aaa 9== 

655 S jf" Ser Ala Glu Glu Val Gin Lys Ala 

660 665 



eta gca gaa ggt cgt ttt oca 9ra 

- «; «u 01y » rg Phe £ Th « - - »c £ .« « 0 

680 

Arg Lp Val ^eu Ala Lys J" Thr Ph« 333 9at 99° Ccc ttt 

685 6 l Q G1U Thr Phe v al Trp Lys Asp Gly Ser Phe 



695 700 



gat eta tec caa get gag tgq caa r- 3 

Asp Leu Ser Gin Ala Glu T^p Gl n Gin ^a ^ ^ 9 tt9 9C3 ^ 

72 0 LSU LeU Ala L V S 



710 715 



725 

^ 730 



1872 



775 780 



eta gat aga gca acc eta gaa qat r»r 

Leu Asp Arg Ala Thr Leu Glu Lp" Has Til " 3 9 " Caa aaa 

785 ZZ ASn Gln Leu Ala Gin Lys 



1920 



790 795 



get aat ate gat cct aag tat etc art- m-. 

Ala Asn lie Asp Pro Ly J Tyr ^ ^ Jtc caa cca gaa ggt gtc caa 

800 ^ Phe Gln Pro Glu Gly Val Gln 

805 810 



1968 



2016 



2064 



2112 



age ate cca aga gca gat g gc aat tr, t-t- 

Ser lie Pro Arg Ala Asp % Ser Ser Lu S Thr 11 ' ^ *** tCt 2160 

705 Z? 9 Tiir Ile Asn Lys Ser 



2208 



Lys Asn Thr I? J j£p ^ J£ ^ |« aaa ccc aaa gaa aag caa 2 25< 

735 ' in ' AS P L y s Pro Lys Glu Lys Gln 

745 

cag gca gat aag age aat aaa 

Gln Ala Asp Lys Ser Asn Glu Asn Gln Gln p" 933 9CC * 9t aaa 23 °* 

750 ASn Gln Gln Pro Ser Glu Ala Ser Lys 

gaa gaa aaa gaa tea qat qac ttt =>>- = 

Glu Glu Lys Glu Ser Lp Lp Phe Lo " a CCa *ac tat ggt 235 2 

7 65 7? £ e ne L ' e As P Ser Leu P ro Asp Tyr Gly 



2400 



2448 
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ttt tat aac aaa aat ggt gaa ttg gta act tat gat ate aag aca ctt 
Phe Tyr Asn Lys Asn Glv Glu Leu Val Thr Tyr Asp lie Lys Thr 

815 820 
caa caa ata aac cct taaccaaaag aagatctcat tgttaaagca ctgctttgtc 
Gin Gin lie Asn Pro 
830 

aaagcaagtt 
aatagcggtt 
gattcttgga 
taaaataatt 
aattgttata 
gataatgttt 
cattatcata 
atagtgagac 
tggtcgttta 
atatttgtta 
tttaggaegg 
attgagegga 
gagccttggt 
aagggtgtct 
teggttatag 
ggtaatatct 
attatctgtt 
aagccagcca 
gatttttttg 
gagctgttca 
caaeggtaat 
ttgtttttat 
gcctcgatac 
ctccaagcct 
ctegtttteg 
ctccataata 



2496 



2551 



ttcccacgaa 
ucaacaggag 
attatttttc 
gttccttcaa 
aagtgttttt 
taccatgaat 
cttccagcaa 
tttcagctac 
aggcttccaa 
acatcttgac 
cggtcttgtg 
ggcaaacata 
gtagtggtct 
tgectctaga 
teataatgeg 
tgatttttct 
caaactccgt 
tcagccagcg 
tttttgtgcc 
<210 



acggtgattt tgaagtcatt ctatgtaacg agtagtgata aaa 9"ggat 
ttcttttgea aagaaatggt atccatgtta gaatagtaaa aaaagaggag 
ctaatgtcaa ataagtagac agaaaactgt 9 ttatttt *"^S„ ttqt 
ttcttctttc tgattagggg ttagtcctag attagcegta J^gggttgt 
aaaattctca atgtattcaa agcagtctaa ttgaacctgt "gatatttt 
teggttgatt tgtctatget ttaaatactt gaaaaatget ^cagttaegg 
aaqatatcca ggattagaaa aagaatgeat gatattggca ctgcacccta 
gSgaaaaa cacttttaggcaatcagtt ^"^""^^"^^^ 
atctctgttg aattctagtt tcattataaa atgtaatgta 
tactatcttt gttgtatttt ctcctattat ggaaataaaa ^"tcagtc 
.gtgaaacca ttcaatacag gcattatctg caggtgttcc 9 
taatgtcttt ttccgtgcaa gcctggtagt aagecataga agtatacact 
cactgtgtaa gattgetect ttatttaggcaatt ttaactgatt 
agtacaaaat ccgtgtcctg acaatctgag atagtgtaag ^tataatttc 
agattcataa ttgatgagag atacaattta cagt-taccga aa "^a 
gttacgagct tttccttagg ettateggea tggaaatccc 9""aattt 
aaataataag ctttacccaa attgggaact ttcttggtac ^gtccgaca 
ttatttttca tgatacgata gactttcttt gtattaacag tcaa^gtg 
aocaatcgtg taatggtacg atagecataa ataaagtgat "tccataca 
attaattcaa taaggtcatc tttttttgcg gcttctcata ctcctttttc 
aggtcgaccg cttgacctta aaacagtcta gaatgaaaac "tegggtag 
agtcttccac aagcttgata agacttactttatcgatt ^ttatcaa 
ttttttaaga ggtcaacctg taattgtaat tgttccactt "gacagatg 
ttacegtagg tatattgett gccaacacct tgatgaaaac gataaagctc 
taccatttca tccaag?ata gatttgacta ttatttttga tgccraaagt 
actctgttag acttgcctgc tttcttcata tegatgeaag «agcttagt 
tatgettttt taaccataat aaaacattcc tgtttctagt "actaaatt 
egcttttett ttgtctcatt ttagggattc agtgcctatt Sttgtcatca 
taaattcccc ggacttaaat tgtgaccctt ggtcggaatg aaagagaagt 
tctttctttt attaagtgaa aaggcaacac ttttctgtac aacatttata 
ctaggcaattaatc ttttagtcat tggtgtttgg ta 9"9agac 
gcgltggtaa ttccaccaat gaacatagtc tttagtctta agagctagtt 
?4La|gtt tcttgataaa caaattcaat tttgaaagca egataegtae 
ggcattgtca taaggataac cagcctgact aagcgaacgt ^gattccaa 
tatttcatca attaactgat tatcaaactc tttgecaega "tgaatgga 
tttggtcagg gcgtaaggga tgctttgtat ggcttgetta «gagttcag 
ccaaccaaa gaeaggcega tgatttcacg gttgtatagg "aatgatga 
agcccaacga ttgcctacac gaacataggt taagtcagtg "taaggctt 
ttcttgetta aattgcctgt ctaagtggtt gggaataggg ^ttcattct 
atgtggtttg aaggtggctt tctgataaac agaaaccaaa "gagtcget 
tcgaatccga cgacgtgaaa gtgtgatacc ttcgttattc "gcatattt 
ggatcegtat ctagactcgc tatcgagaaa aattctttta ^"tctt 
ttcagatact gactccacgg cttgatagta ataacttgag ^ggcatat 
acacatcttt gaaatgctgt atttatcctt attagcagtg autatttccc 
ataatcaccg ctgettgett taggatatct aatt 



2611 
2671 
2726 
2786 
2846 
2906 
2966 
3025 
3085 
3145 
3205 
3265 
3319 
3379 
3439 
3499 
3559 
3619 
3679 
3739 
3799 
3857 
3917 
3977 
4037 
4097 
4157 
4217 
4277 
4337 
4391 
4451 
4511 
4571 
4631 
4691 
4751 
4811 
4871 
4931 
4991 
5051 
5111 
5171 
5215 



> 14 



<211> 40 
<212> PRT 
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<213> Streptococcus 
<400> 14 

..r »1. L.„ ,. t Thr val „ lu T>1 Lys Mu n< s ^ 

M " As ° :1 * I? Tyr - ;» «. «. Thr £ Tyr 

Ser Tyr Trp Lys Leu Pro Thr Trp " 30 

40 



35 



<210> 15 
<2il> 793 
<212> PRT 

<213> Streptococcus 



<400> 15 

Met Gly Leu Ala 
1 

Lys Gly Lys Ala 

20 

Ser Ala Glu Glu 
35 

Asp Gin Gly Tyr 
50 

Gly Lys Val Pro 
65 

Asp Pro Asn Tyr 

Asp Gly Tyr Val 

100 

Pro Gly Ser Lys 
115 

Gin Val Ala Lys 
130 

Val Ala His Leu 
145 

Arg Gin Gly Arg 



Thr Lys Asp Asn 
5 

Lys Ala Pro Lys 

Gly He Ser Ala 

40 

Val Thr Ser His 
55 

Tyr Asp Ala He 
70 

Arg Phe Lys Gin 
85 

He Lys Val Asn 



Asp He He Asp 

180 

His Tyr His Tyr 
195 

Ala Ala Gin Ala 
210 

Ser Asp Tyr Arg 
225 

Pro Asp Val Thr 

Gly Tyr His Pro 

260 

His Gin Arg Asp 
275 

Gin Leu His Arg 
290 

Leu He Phe Glu 
305 



Arg Lys Asn He 

120 

Gly Thr Lys Glu 
135 

Ser Lys Glu Glu 
150 

Tyr Thr Thr Asd 
165 

Asp Leu Gly Asp 

He Pro Lys Lys 

200 

Tyr Trp Ser Gin 
215 

Pro Thr Pro Ala 
230 

Pro Asn Pro Gly 
245 

Ala Pro Pro Arg 



Gin He 
10 

Thr Asn 
25 

Glu Gin 

Gly Asp 

He Ser 

Ser Asp 
90 

Gly Asn 
105 

Arg Thr 



Ala Tyr He 

Lys Thr Met 

He Val Val 
45 

His Tyr His 
60 

Glu Glu Leu 
75 

Val li e Asn 
Tyr Tyr Val 



Ala Lys 

Val Ala 

Asp Gly 
170 
Ala Tyr 
185 

Asp Leu 
Lys Gin 
Pro Gly 



Lys Gin Gin 
125 

Glu Lys Gly 
140 

Ala Val Asn 
155 

Tyr He Phe 



Asp Asp Ser 
15 

Asp Gin He 

30 

Lys He Thr 

Phe Tyr Asn 

Leu Met Thr 
80 

Glu lie Leu 
95 

Tyr Leu Lys 
110 

lie Ala Glu 
Leu Ala Gin 



Glu Phe Lys Gly 

280 

Leu Asp Leu Lys 
295 

Pro Thr Gin Val 
310 



Gin Gly 
250 
Pro Asn 
265 

Lys Thr 
Tyr Arg 
He Lvs 



Leu Val Pro 

Ser Pro Ser 
205 

Gly Arg Gly 

220 
Arg Arg Lys 
235 

His Gin Pro 

Asp Ala Ser 

Phe Lys Glu 
285 

His Val Glu 

300 
Ser Asn Ala 
315 



Glu Ala Lys 
160 

Ser Pro Thr 
175 

His Gly Asn 
190 

Glu Leu Ala 

Ala Arg Pro 

Ala Pro lie 
240 

Asp Asn Gly 

255 
Gin Asn Lys 
270 

Leu Leu Asp 

Glu Asp Gly 

Phe Gly Tyr 
320 
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val Val Pro His Gly Asp His Tyr His lie He Pro Arg Ser Gin Leu 

Ser Pro Leu Glu Met Glu Leu Ala Asp Arg Tyr Leu Ala Gly Gin Thr 

Glu Asp Asn Asp Ser Gly Ser Glu His Ser Lys Pro Ser Asp Lys Glu 

Val Thr His Thr Phe Leu Gly His Arg He Lys Ala Tyr Gly Lys Gly 

Leu Asp Gly Lys Pro Tyr Asp Thr Ser Asp Ala Tyr Val Phe Ser Lys 

III Ser He His Ser III Asp Lys Ser Gly Val Thr Ala Lys His Gly 

Asp His Phe His Tyr He Gly Phe Gly g" Leu Glu Gin Tyr Glu Leu 

Asp Glu Val Ala Asn Trp Val Lys Ala Lys Gly Gin Ala Asp Glu Leu 

Ala Ala Ala Leu Asp Gin Glu Gin Gly Lys Glu Lys Pro Leu Phe Asp 

50 455 460 

Thr Lys Lvs Val Ser Arg Lys Val Thr Lys Asp Gly Lys Val Gly Tyr 

Met Met Pro Lys Asp Vly Lys Asp Tyr Phe Tyr Ala Arg Asp Gin Leu 

Asp Leu Thr Gin He Ala Phe Ala Glu Gin Glu Leu Met Leu Lys Asp 

Lys Lys His Tyr Arg Tyr Asp He Val Asp Thr Gly He Glu Pro Arg 

Leu Ala val Asp Val Ser Ser Leu Pro Met His Ala Gly Asn Ala Thr 

540 



val Val Pro Tyr Ser Trp Leu Thr Arg Asp Gin lie Ala Thr Val Lys 

Tyr Val Met Gin His Pro Glu Val Arg Pro Asp Val Trp Ser Lys Pro 

5 8 5 5 9 0 

Gly His Glu Glu Ser Gly Ser Val He Pro Asn Val Thr Pro Leu Asp 

595 600 -i 

Lys Arg Ala Gly Met Pro Asn Trp Gin He He His Ser Ala Glu Glu 

615 620 
val Sn Lys Ala Leu Ala Glu Gly Arg Phe Ala Thr Pro Asp Gly Tyr 

III Phe Asp Pro Arg Asp Val Leu Ala Lys III Thr Phe Val Trp Lys 

Asp Gly Ser Phe III He Pro Arg Ala Asp Gly Ser Ser Leu Arg Thr 

660 665 
He Asn Lys Ser Asp Leu Ser Gin Ala Glu Trp Gin Gin Ala Gin Glu 

Leu Leu S£a Lys Lvs Asn Thr Gly Asp Ala Thr Asp Thr Asp Lys Pro 

695 700 
Lys Glu Lys Gin Gin Ala Asp Lys Ser Asn Glu Asn Gin Gin Pro Ser 

HI Ala Ser Lys Glu III Lys Glu Ser Asp Asp Phe He Asp Ser Leu 

725 730 73 , 

Pro Asp Tvr Gly Leu Asp Arg Ala Thr Leu Glu Asp His He Asn Gin 

' 740 745 

Leu Ala Gin Lvs Ala Asn He Asp Pro Lys Tyr Leu lie Phe Gin Pro 
755 ' 7 &0 765 - 
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Glu Gly Val Gln Phe Tyr Asn Lys Asn Gly Qlu Leu ^ ^ ^ 

lie Lys Thr Leu Gin Gin He Asn Pro 78 ° 

785 790 



<210> 16 
<211> 715 
<212> PRT 

<213> Streptococcus 
<400> 16 



Met Thr Asd Pro Asn Tvr 
i . ro Asn Tyr Arg Phe Lys Gin Ser Asp Val He Asn Glu 

10 

^ °J" ^ V " "» >*' ™ *» Cly A,„ Tyr Tyr ^ ^ 
Leu Lys Pro <n y ser Lys irg , ys 11 ne wr ^ ^ 

ai. «„ „. v.! «. Lys „„ Thr Lys Glu „. Lys « u Lys My ^ 

«. Oln V.1 „. „. £, s „ Ly « „„ 01u y>1 £ ua ^ 

M. Ly= Cln Cly Arg Xyr Thr Thr Asp ^ „ y ^ ^ £ 

^= „r „, „. n . Asp Asp ^ My » p m> ^ %i £ ^ 

Gly Asn His Tyr His Tyr He Pro ill t * 110 

115 7 Lys Lys As P Leu Ser Pro Ser Glu 

Leu Ala Ala Ala Gin Ala Tvr- o 125 

13 0 Trp Ser Gln Gin Gly Arg Gly Ala 

S Pro ser Asp Tyr Arg Pro Thr Pro ^ prQ ^ ^ ^ ^ ^ 

Pro He Pro Asp Val Thr Pro Asn Pro Gly Gln Gly H1 . aln Pro £° 

Asn Glv Gly Tyr His Prn zn a D „ 1? ° 175 

yr Pro Ala Pro Pro Arg Pro Asn Asp Ala Ser Gln 

Asn Lys His Gln Ara A^n pin du« t , 190 

Arg Asp Glu Phe Lys Gly Lys Thr Phe Lys Glu Leu 

Leu Asp Gln Leu His Ara i P „ t , 205 

210 9 215 P " LyS Tyr Ar 9 His Val Glu Glu 

225 " U ^ S; *" Thr Gin Val Xle £.° Ser Asn Ala Phe 

Cly Tyr Val Val Pro His Gly Asp Hls Tyr S Xle Xle Pro Arg Ser 

Gln Leu Ser Pro Leu Glu Mer <-i,, t ~ 255 

Glu Met Glu Leu Ala Asp Arg Tyr Leu Ala Gly 

Gln Thr Glu Asp Asn Asp Ser Gly Ser Gln w e 27 ° 

275 280 LyS Pr ° Ser Asp 

Lys Glu Val Thr His Thr Phe Leu Gly His Ara II. ^ al 

290 2g5 y s Ara Ile Lys Ala Tyr Gly 

Lys Gly Leu Asp Gly Lvs Pro Tvr- a- - 300 

3 05 y Pr ° Tyr As P Thr Ser Asp Ala Tyr Val Phe 

Ser Lys Glu Ser Ile His w= -. * , 315 320 

lie His Ser Va, Asp Lys Ser Gly Val Thr Ala Lys 

His Gly Asp His Phe His Tyr He Glv III ni 335 

340 Y le Giy Phe Q ly Glu Leu Glu Gln Tyr 



345 350 
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Glu Leu Asp Glu Val Ala Asn Trp Val Lys Ala Lys Gly Gin Ala Asp 

355 360 365 

Glu Leu Ala Ala Ala Leu Asp Gin Glu Gin Gly Lys Glu Lys Pro Leu 

370 375 380 

Phe Asp Thr Lys Lys Val Ser Arg Lys Val Thr Lys Asp Gly Lys Val 
385 390 395 400 

Gly Tyr Met Met Pro Lys Asp Gly Lys Asp Tyr Phe Tyr Ala Arg Asp 

405 410 415 

Gin Leu Asp Leu Thr Gin lie Ala Phe Ala Glu Gin Glu Leu Met Leu 

420 425 430 

Lys Asp Lys Lys His Tyr Arg Tyr Asp lie Val Asp Thr Gly lie Glu 

435 440 445 

Pro Arg Leu Ala Val Asp Val Ser Ser Leu Pro Met His Ala Gly Asn 

450 455 460 

Ala Thr Tyr Asp Thr Gly Ser Ser Phe Val lie Pro His lie Asp His 
465 470 475 480 

lie His Val Val Pro Tyr Ser Trp Leu Thr Arg Asp Gin lie Ala Thr 

485 490 495 

Val Lys Tyr Val Met Gin His Pro Glu Val Arg Pro Asp Val Trp Ser 

500 505 510 

Lys Pro Gly His Glu Glu Ser Gly Ser Val lie Pro Asn Val Thr Pro 

515 520 525 

Leu Asp Lys Arg Ala Gly Met Pro Asn Trp Gin lie lie His Ser Ala 

530 " 535 540 

Glu Glu Val Gin Lys Ala Leu Ala Glu Gly Arg Phe Ala Thr Pro Asp 
545 550 555 560 

Gly Tyr lie Phe Asp Pro Arg Asp Val Leu Ala Lys Glu Thr Phe Val 

565 570 575 

Trp Lys Asp Gly Ser Phe Ser lie Pro Arg Ala Asp Gly Ser Ser Leu 

580 585 590 

Arg Thr lie Asn Lys Ser Asp Leu Ser Gin Ala Glu Trp Gin Gin Ala 

595 600 605 

Gin Glu Leu Leu Ala Lys Lys Asn Thr Gly Asp Ala Thr Asp Thr Asp 

610 615 620 

Lys Pro Lys Glu Lys Gin Gin Ala Asp Lys Ser Asn Glu Asn Gin Gin 
625 630 635 640 

Pro Ser Glu Ala Ser Lys Glu Glu Lys Glu Ser Asp Asp Phe lie Asp 

645 650 655 

Ser Leu Pro Asp Tyr Gly Leu Asp Arg Ala Thr Leu Glu Asp His lie 

660 665 670 

Asn Gin Leu Ala Gin Lys Ala Asn lie Asp Pro Lys Tyr Leu lie Phe 

675 680 685 

Gin Pro Glu Gly Val Gin Phe Tyr Asn Lys Asn Gly Glu Leu Val Thr 

690 695 700 

Tyr Asp lie Lys Thr Leu Gin Gin lie Asn Pro 
705 710 715 

c210> 17 
<211> 77 
<212> PRT 

<213> Streptococcus 
<400> 17 

Met His Ser Phe Ser Asn Pro Gly Tyr Pro Tyr Asp Asn Ala Val Thr 
15 10 15 
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Glu Ala Phe Phe Lys Tyr Leu Lys His Arg Gin lie Asn Arg Lys His 

•i. 0 2 5 

Tyr Gin Asn He Lys Gin Val Gin Leu Asp Cys Phe Glu Tyr He Glu 

Asn Phe Tyr Asn Asn Tyr Asn Pro His Thr Ala Asn Leu Gly Leu Thr 

50 55 60 

Pro Asn Gin Lys Glu Glu Asn Tyr Phe Asn Ala He Lvs 
65 7 ° 75 

<210> 18 

<211> 86 

<212> PRT 

<213> StreDtococcus 



<400> 18 

Met Ala Tyr Tyr Gin Ala Cys Thr Glu Lys Asp He He Arg Ser Met 

Ser Arg Lys Gly Thr Pro Ala Asp Asn Ala Cys lie Glu Trp Phe His 

Thr Val Leu Lys Thr Glu Thr Phe Tyr Phe Hxs Asn Arg Arg Lys Tyr 

" 40 45 

Asn Lys Asp Ser He Thr Asn He Val Lys Asn Tyr He Thr Phe Tyr 



55 60 
Asn Glu Tnr Arg He Gin Gin Arg Leu Asn Asp Gin Ser Pro Val Gin 

7 0 75 
Tyr Arg Lys Leu He Ala 80 

85 



<210> 19 
<211> 126 
<212> PRT 

<2 13 > Streptococcus 



<400> 19 

Met Glu Asn His Phe He Tyr Gly Tyr Arg Thr He Thr Arg Leu Leu 

Lys Lys He His Gly Leu Thr Val Asn Thr Lys Lys Val Tvr Arq He 

20 25 30 

Met Lys Asn Asn Gly Trp Leu Cys Arg Thr Arg Thr Lys Lys Val Pro 

Asn Leu Gly Lys Ala Tyr Tyr Leu Thr Asp Asn Lys Leu Ser Arg Asp 

50 55 60 

Phe His Ala Asp Lys Pro Lys Glu Lys Leu Val Thr Asp He Thr Tyr 

Leu Tyr Phe Gly Asn Cys Lys Leu Tyr Leu Ser Ser He Met Asn IL 

Tyr Asn Arg Glu He He Ala Tyr Thr He Ser Asd Cys Gin Isp Thr 

105 110 

Asp Phe Val Leu Asp Thr Leu Asn Gin Leu Lys Leu Pro Lys 
115 120 125 

<210> 20 
<211> 96 
<212> PRT 

<213> Streotococcus 
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<400> 20 
Met Val Lys Lys Ala Tyr 

1 5 
Met Lys Lys Ala Gly Lys 

20 

He Lys Asn Asn Ser Gin 
35 

Glu Glu Leu Tyr Arg Phe 
50 

Gly Lys Glv Leu Glu His 
65 ' 70 

Val Asp Leu Leu Lys Lys 

85 



Ser Trp Glu Thr Lys 

10 

Ser Asn Arg Val He 
25 

He Tyr Thr Trp Met 
40 

His Gin Gly Val Gly 
55 

Leu Ser Glu Val Glu 

75 

Tyr Arg Gly Leu He 

90 



Leu Ala Cys He Asp 

15 

Met Glu Thr Leu Gly 
30 

Lys Trp Tyr Glu Asn 
45 

Lys Gin Tyr Thr Tyr 
60 

Gin Leu Gin Leu Gin 

80 

Arg Lys Ser He Lys 

95 



<210> 21 
<211> 288 
<212> PRT 

<213> streptococus 



<400> 21 
He Arg Tyr Pro 
1 

He He Thr Ala 

20 

Trp Leu Asn Met 
35 

Val Ser Glu Thr 
50 

Ser Glu Ser Arg 
65 

Glu Gly He Thr 

Leu Asn Leu Val 

100 

Arg Gly Lys Asn 
115 

Lys Gin Glu Arg 
130 

Arg Val Gly Asn 
145 

Asn Arg Glu He 

Leu Val Lys Gin 

180 

Lys Met Phe His 
195 

Asp Glu He Leu 
210 

Gly Tyr Pro Tyr 
225 

Lys lie Glu Phe 

Ala Leu Lys Thr 

260 

His Gly Ser Leu 
275 



Lys Ala Ser Ser 
5 

Asn Lys Asp Lys 

Pro His Ser Ser 

40 

Glu Phe Glu Glu 
55 

Tyr Gly Ser Arg 
70 

Leu Ser Arg Arg 
85 

Ser Val Tyr Gin 

Glu Ala Pro He 

120 

Pro Leu Gin Ala 
135 

Arg Trp Ala Tyr 
150 

He Gly Leu Ser 
165 

Ala He Gin Ser 

Ser Asp Arg Gly 

200 

Glu Ala Phe Gly 
215 

Asp Asn Ala Val 
230 

Val Tyr Gin Glu 
245 

Lys Asp Tyr Val 

Asn Tyr Gin Thr 

280 



Gly Asp Tyr Gly 
10 

Tyr Ser lie Ser 
25 

Tyr Tyr Tyr Gin 

Thr He Lys Arg 

60 

Lys He Lys He 
75 

Arg He Arg Arg 
90 

Lys Ala Thr Phe 
105 

Pro Asn His Leu 

Leu Val Thr Asp 

140 

Val Cys Leu He 
155 

Leu Gly Trp His 
170 

lie Pro Tyr Ala 
1B5 

Lys Glu Phe Asp 

He Thr Arg Ser 

220 

Ala Glu Ser Thr 
235 

Thr Phe Gin Leu 
250 

His Trp Trp Asn 
265 

Pro Met Thr Lys 



Thr Lys Arg Glu 
15 

Lys Met Cys Arg 
30 

Ala Val Glu Ser 
45 

He Phe Leu Asp 

Cys Leu Asn Asn 

80 

He Met Lys Arg 
95 

Lys Pro His Ser 
110 

Asp Arg Gin Phe 
125 

Leu Thr Tyr Val 

He Asp Leu Tyr 

160 

Lys Thr Ala Glu 
175 

Leu Thr Lys Val 
190 

Asn Gin Leu He 

205 

Leu Ser Gin Ala 

Tyr Arg Ala Phe 

240 

Leu Glu Glu Leu 
255 

Tyr His Arg He 
270 

Arg Leu lie Ala 
265 
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<210> 22 
<211> 5058 
<212> DNA 

<213> streptococcus 

<220> 

<221> CDS 

<222> (1) . . . (663) 

<221> CDS 

<222> (763) . . . (1344) 
<221> CDS 

<222> (1362) . . . (1739) 
<221> CDS 

<222> (2266) . . . (5058) 
<400> 22 



aat ttg aaa gca gaa tea tct gta gaa gat gag caa tat aca oca aca 
Asn Leu Lys Ala Glu Leu Ser val Glu Asp IlS Gin Tyr T hr 111 



10 15 



?r tCt 9Ct Cat 9gt tca aca c " caa gaa got att aat 

Val Tyr Gly Lys Ser Ala His Gly Ser Thr Pro Gin Ilu ??y Tsl 

20 2 5 30 

ss in m i" s m it: - s: s s; <» - s i" sr y 



40 45 



Pro lit 111 1,1 Ph C r" 3 9 " 9CC aSC att att cac 3" 9« 

Pro Ala Arg Ala Phe Leu Asp Val Thr Ala Asn He He His Glu Asp 



55 60 



Phf ??* Ctt 993 9ta 9Ct tat 9 aa Sat gac tgt atg gga 

Phe Ser Gly Glu Lys Leu Gly Val Ala Tyr Glu Asp Asp Cys Me^ lly 

cca ttg age atg aat gca ggt gtc ttc cag ttt gat qaa act aat aat 
Pro Leu Ser Met Asn Ala Gly Val Phe Gin Phe Asp Tin Thr tin ls P 

85 90 95 

gat aat act ate get ctt aat ttc cgt tac cca caa ggg aca gat act 
Asp Asn Thr lie Ala Leu Asn Phe Arg Tyr Pro Gin G?y 5hr ll P III 

100 ios 110 

?s lit r aa ™ c f ag ctt 939 aaa ctt aac ^ gaa aaa gtg 

Lys Thr lie Gin Thr Lys Leu Glu Lys Leu Asn Gly Val Glu Lvs Val 
115 120 125 

act ctt tct gac cat gaa cac aca cca eac tat gta cet atg gac qat 
Thr Leu Ser Asp His Glu Hi. Thr Pro His Tyr Val Pro Me? Hp As P 

^3 



48 



96 



144 



192 



240 



288 



336 



384 



432 



140 
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qaa tea gta tea acc tta eta get gtc tat gaa aag caa act ggt ctt 
llu Leu Sal Ser Thr Leu Leu Ala Val Tyr Glu Lys Gin Thr Gly Leu 
145 150 155 

=aa qqa cat gaa cag gtt att ggt ggt ggg aca ttt ggt cgc tta ctt 
lys G?v His Glu Gin val He Gly Gly Gly Thr Phe Gly Arg Leu Leu 
' 165 170 

aaa cot qqt act gca tac ggt gec atg ttc cca gga gat gaa aac act 
Tin Arg Gly Val Ala Tyr Sly Ala Met Phe Pro Gly Asp Glu Asn Thr 

180 185 190 

atg cat caa get aat gag tac atg cct tta gaa aat att ttc cgt teg 
Met His Gin Ala Asn Glu Tyr Met Pro Leu Glu Asn He Phe Arg Ser 
195 200 205 

get get ate tac gca gaa get ate tat gaa tta ate aaa taaaataatc 
Ala Ala lie Tyr Ala Glu Ala He Tyr Glu Leu He Lys 
210 215 220 

cttaaactaa ataegtgate aatgataaag ggtggtgaag acatgaaagt gtctttgect 
ettttcataa ggttagattt ggagacttt atg act gac ttg gaa aaa att att 

Met Thr Asp Leu Glu Lys He He 

225 

aaa gca ata aaa agt gat tea cag aat caa aat tat aca gaa aat ggt 
LV s Ala lie Lys Ser Asp Ser Gin Asn Gin Asn Tyr Thr Glu Asn Gly 
230 235 240 245 

att gat cct ttg ttt get get cct aaa aca get agg ate aat att gtt 
He Asd Pro Leu Phe Ala Ala Pro Lys Thr Ala Arg He Asn He Val 

250 255 260 

aac caa gca cct ggt tta aaa act caa gaa gca aga etc tat tgg aaa 
fly Gin Ala Pro Glv Leu Lys Thr Gin Glu Ala Arg Leu Tyr Trp Lys 

265 ' 270 275 

gat aaa net gga gat cgt eta ege cag tgg ctt gga gtt gat gaa gag 
Lp Lys Ser Gly Asp Arg Leu Arg Gin Trp Leu Gly Val Asp Glu Glu 
280 285 290 

aca ttt tae cat tct gga aaa ttt get gtt tta cct tta gat ttt tat 
Thr Phe Tvr His Ser Gly Lys Phe Ala Val Leu Pro Leu Asp Phe Tyr 
295 300 305 

tac cca ggc aaa gga aaa tea gga gat tta ecc cct aga aaa ggt ttt 
Tyr Pro Gly Lys Gly Lys Ser Gly Asp Leu Pro Pro Arg Lys Gly Phe 
310 315 320 

gcg gag aaa tgg eac cct ctt att tta aaa gaa atg cct aat gtt caa 
Ala Glu Lys Trp His Pro Leu He Leu Lys Glu Met Pro Asn V»l Gin 

330 335 340 

ttg aec ttg eta gtt ggt cag tat get cag aaa tat tat ctt gga age 
Leu Thr Leu Leu Val Gly Gin Tyr Ala Gin Lys Tyr Tyr Leu Gly Ser 

345 350 355 



480 



528 



576 



624 



673 



733 
786 



834 



882 



930 



978 



1026 



1074 



1122 



1170 
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m i" s;: l" hi n: s s s; r s e r *- 9 - 

360 ^ U Thr Val L Y S Al * Tyr Lys Asp Tyr 

365 370 

eta ccc aa: tat tta ccc rtn <-n-f- 

Leu Pro Aso Tvr Leu Pro Jeu « CC9 C9 * aat " a aCC 

37 - > r ^ eu Pr ° ^ Val his Pro Ser Pro Arg Asn Gin lie 

tgg eta aag aag aat cca tgg ttt gaa aaa gat eta ate att aar ffa 
Trp ,eu Lys Lys Asn Pro Trp Phe Glu Lys Asp W £ £J £J Leu 

95 400 405 

3" it: e: i:< k £ - ^ «■»■«» 

4 l n Met 

415 

aga gat aat cat eta cac acq tat ttt rrr r^r nr**- «- - 

«, .«p *s» «i. ^ „. Th ? ^ £ £ L" £ £ S S 

425 430 

ttt gag gac tat att aat gqt ttt ars rrrrt- = 

Phe Glu Asp Tyr He Asn Sy Phe Thr g7v Gl" ph T *" S ** 

4^s y Glu Phe Ile T hr Thr Glu 

440 445 

s" £ k s "< "i £ si =- r r c 9tt c " 3 " 

450 ,« 7 Gly Gln As P As P Pro Asp 

4bb 460 

£ sir is i% s: r T at ctt aat cas aaa tat " a 

465 ' ?™ Y S ASp Tyr Leu Asn Gin Lys Tyr Gly 

° 475 480 

A^n A?g Pne £ ^ £ ^ ^ 9 f J" 3" ^ 9« 

i Y U Ile G1 y Phe Lys Asp Arg Glu 

85 490 495 

se" n: iz ^ ti: r ?« aaa 9aa tct 9at tta aaa <*• 

500 yS LyS G1U PhS As P Leu L Y S 

505 510 

ttg tea ate cat cat aat qqt aaa tat- n=t- 

-» ,. r „. Hi » „ i6 is „ «? y g " Y i m £ s; s" si: s; e 

5 2 0 52 5 

ctg aaa gta cca aca aaq qqa act ttt- = 

,eu Lys val Pro Thr Lyf £ y A l" Phe Sr ^ S Leu 

^-^^ 540 



1218 



1266 



1314 



1364 



1412 



1460 



1508 



1556 



1604 



1652 



1700 



1749 



gaatttgeca 
aagttaaact 
ataaagatgt 
gggaatgaaa 
tactctatag 
caaggtctgc 
taaaaaagct 



taggccgtgt 
tagatgtaga 
tatctaaggg 
aactttatcg 
getctgaegg 
taaaggacta 
aggcaatatt 



ggaagegcac 
agatttaaaa 
gttagctttt 
etatgettta 
tcatattcct 
tcaaattgat 
gcttagcttt 



gttttagctc 
ccgtttgaaa 
gaactaaata 
gagatactca 
gaacattttt 
gaaaatcatt 
tttgtaatgc 



actttgatta 
cgcaattgaa 
ccaaatccct 
aacagcttgg 
gttatgaatt 
tgatatgagg 
tattgatagt 



tggttttcgt 1809 

gegcatttte 1869 

ttatctatat 1929 

ttgtaaacaa 1989 

tgatagactt 2049 

aaatttttga 2109 

tttagtgaaa 2169 
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atttcaaaaa aataaagaaa tcatttactt gttgcaagcg cttgcgcaaa ttgttatgat 

tttattaqta acaattcatt aaaaaaggag aatgat atg aaa aga aaa .gac tta 
tttattgg Arg Lys Asp Leu 



545 



-tt qqt gat aaa caa act caa tac acg att aga aag tta agt gtt gga 
Phe Gly Asp Lys Gin Thr Gin Tyr Thr lie Arg Lys Leu Ser Val Gly 
550 555 560 

gta get tea gtt aca aca ggg gta tgt att ttt ctt cat agt cca cag 
Val Ala Ser Val Thr Thr Gly Val Cys He Phe Leu His Ser Pro Gin 
565 570 575 

qta ttt get gaa gaa gta agt gtt cct cct gca act aca gcg att gca 
Val Phe Ala Glu Glu Val Ser Val Ser Pro Ala Thr Thr Ala He Ala 



580 



585 590 595 



qaq tcq aat att aat cag gtt gac aac caa caa tct act aat tta aaa 
Glu Ser £sn He Asn Gin Val Asp Asn Gin Gin Ser Thr Asn Leu Lys 

600 605 610 



qat qac ata aac tea aac tct gag acg gtt gtg aca ccc tea gat atg 
Asp Asp He Asn Ser Asn Ser Glu Thr Val Val Thr Pro Ser Asp Met 

615 620 625 

ccq qat ace aag caa tta gta tea gat gaa act gac act caa aag gga 
Pro Asp Thr Lys Gin Leu Val Ser Asp Glu Thr Asp Thr Gin Lys Gly 
630 635 640 

gtg aca gag ccg gat aag gcg aca age ctg ctt gaa gaa aat aaa ggt 
Val Thr Glu Pro Asp Lys Ala Thr Ser Leu Leu Glu Glu Asn Lys Gly 
645 650 655 

cct gtt tea gat aaa aat acc tta gar tta aaa gta gca cca tct aca 
Pro Val Ser Asp Lys Asn Thr Leu Asp Leu ' Lys Val Ala Pro Ser Thr 
660 665 670 675 

ttg caa aac act ccc gac aaa acc tct caa get ata ggt get cca age 
Leu Gin Asn Thr Pro Asp Lys Thr Ser Gin Ala lie Gly Ala Pro Ser 

680 685 690 

cct acc ttg aaa gta get aat caa get cca egg att gaa aat ggt tac 
Pro Thr Leu Lys Val Ala Asn Gin Ala Pro Arg He Glu Asn Gly Tyr 

695 700 705 

ttt agg eta cat ctt aaa gaa ttg cct caa ggt cat cct gta gaa age 
Phe Arg Leu His Leu Lys Glu Leu Pro Gin Gly His Pro Val Glu Ser 
710 715 720 

act gga ctt tgg ata tgg gga gat gtt gat caa ccg tct agt aat tgg 
Thr Gly Leu Trp He Trp Gly Asp Val Asp Gin Pro Ser Ser Asn Trp 
725 7 ^0 7 35 

cca aat ggt get ate cct atg act gat get aag aaa gat gat tac ggt 
Pro Asn Gly Ala He Pro Met Thr Asp Ala Lys Lys Asp Asp Tyr Gly 
740 7 45 7 50 7 55 
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2283 



2331 



2379 



2427 



2475 



2523 



2571 



2619 



2667 



2715 
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/bb 770 
ttt tta att aat aac aaa qca qqo ars * = t- ^ 

»- l™ u. »,„ *.„ Ly6 l. « s "n s m i?= l" s: s: 

-780 ?85 

£ s s s sg - « s; s s - s: x 

795 800 
tac ggc ata cat act tat caa ccc ctr = « 

Tyr „y .1. „ is Thr Tyt „ ^ «° £ |» »- - £c £ .« 

B1 ° 815 



aac tat ttg agt tec tct agt aac tat gac 
Asn Tyr Leu Ser Ser Ser Ser Asn Tyr Asp 
820 

ttt aaa gat gtt gca acc ccy tea aca act 
Phe Lys Asp Val Ala Thr Xaa Ser Thr Thr 

840 845 

ttt gtg aat caa gga eta tat gga agg tat 
Phe Val Asn Gin Gly Leu Tyr Gly Arg Tyr 

855 860 



cac tta tea gca tgg etc 
His Leu Ser Ala Trp Leu 
830 835 

tgg cca gat ggt agt aat 
Trp Pro Asp Gly Ser Asn 

850 

att gat gta tea eta aaa 
He Asp Val Ser Leu Lys 

865 



act aac gee aaa gag att aat ttt- nt-a =>^,- 

Thr ». w Q1 l n . i? y £ «■ "J «. £ .j. ^ jjj s 

875 880 
gga gat gca gtg aaa gtt caa ccc aac qac tat ~m- t-t-t- 

oiy »p «. v.i Ly , v.i jjj Pro A »„ £ £ SJ lit S K S 

B9 ° 895 
get aac cat aac caa att ttt at* a asa 

Ala Asn His Asn Gin II Phe Va! Lv. I CC * ^ 9 9tt tat 

90 0 one y ASP Lys As P Pro L Y S v al Tyr 

910 915 

As*n "J Pro £ £ £ f? ^ 9 " 9 9at 3CC «* " a 

iyr ryr lie Asp Gin Val Gin Leu Lys Asp Ala Gin Gin 

920 92 * 930 

att gat tta aca agt att caa gca aqt ttt aca xrr = „ - 

ii. -p l« s „ „. G1 „ i la s l £ s s s s s? 

940 945 

gat aaa act gaa att tta aaa <-raa 

*, Thr Glu n . Leu £ "I S »« - f £ JJt J- ... cj. 

955 960 

E E £ £ S £ 5 S 5 K S S E £ 2 2 

970 9?5 



2955 



3003 



3051 



3099 



3147 



3195 



3243 



3291 
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tta ata ate aaa ggc gac ttt aat cct aaa caa ggt cat ttc aac ata 
Leu He He Lys Gly Ast> Phe Asn Pro Lys Gin Gly His Phe Asn He 
980 985 990 995 

tct tat aat ggt aac aat gtc atg aca agg caa tct tgg gaa ttt aaa 
Ser Tyr Asn Gly Asn Asn Val Met Thr Arg Gin Ser Trp Glu Phe Lys 

1000 1005 1010 

qac caa ctt tat get tat agt gga aat tta ggt gca gtt etc aat caa 
Asp Gin Leu Tyr Ala Tyr Ser Gly Asn Leu Gly Ala Val Leu Asn Gin 

1015 1020 1025 

qat ggt tea aaa gtt gaa gee age etc tgg tea ccg agt get gat agt 
Asp Glv Ser Lys Val Glu Ala Ser Leu Trp Ser Pro Ser Ala Asp Ser 
' 1030 1035 1040 

gtc act atg att att tat gac aaa gat aac caa aac agg gtt gta gcg 
Val Thr Met He lie Tyr Asp Lys Asp Asn Gin Asn Arg Val Val Ala 
1045 1050 1055 

act ace ccc ctt gtg aaa aat aat aaa ggt gtt tgg cag acg ata ctt 
Thr Thr Pro Leu Val Lys Asn Asn Lys Gly Val Trp. Gin Thr He Leu 
1060 1065 1070 1075 

qat act aaa tta ggt att aaa aac tat act ggt tac tat tat ctt tac 
Asp Thr Lys Leu Gly He Lys Asn Tyr Thr Gly Tyr Tyr Tyr Leu Tyr 

1080 1085 1090 

qaa ata aaa aga ggt aag gat aag gtt aag att tta gat cct tat gca 
Glu He Lvs Arg Gly Lys Asp Lys Val Lys He Leu Asp Pro Tyr Ala 

1095 HOO H05 

aag tea tta gca gag tgg gat agt aat act gtt aat gat gat att aaa 
Lvs Ser Leu Ala Glu Trp Asp Ser Asn Thr Val Asn Asp Asp He Lys 
1110 1115 1120 

acg get aaa gca get ttt gta aat cca agt caa ctt gga cct caa aat 
Thr Ala Lys Ala Ala Phe Val Asn Pro Ser Gin Leu Gly Pro Gin Asn 
1125 H30 H35 

tta agt ttt get aaa att get aat ttt aaa gga aga caa gat get gtt 
Leu Ser Phe Ala Lys He Ala Asn Phe Lys Gly Arg Gin Asp Ala Val 
1140 H45 H50 H55 

ata tac aaa gca cat gta aga gac ttc act tct gat cga tct ttg gat 
II- T yr Glu Ala His Val Arg Asp Phe Thr Ser Asp Arg Ser Leu Asp 

1160 H65 H70 

qqa aaa tta aaa aat caa ttt ggt ace ttt gca gee ttt tea gag aaa 
Glv Lys Leu Lys Asn Gin Phe Gly Thr Phe Ala Ala Phe Ser Glu Lys 

H75 1180 H85 

eta gat tat tta cag aaa tta gga gtt aca cac att cag ctt tta ccg 
Leu Asp Tyr Leu Gin Lys Leu Gly Val Thr His He Gin Leu Leu Pro 
1190 1195 1200 
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gta ttq aat tat ttt tat at-- = = ~ 

Val Leu Ser Tvr Phe Itr vll 1 ^* * t9 93C 333 tCa C * c tca aca 
12 05 ^ ASP LyS Ser Ar 9 Ser Thr 

1*10 1215 

Ila Ser tit f ° *" " C aat t9 9 99 c c « S« cca cac 

Ala Tyr T... Ser Ser Asp Asn Asn Tvr Asn Trp Glv Tvr Asp Pro G1 * 

5 123 ° 1235 

S £ £ K 2 £ S? - - »; - «• ~ •» - s 

240 1245 1250 

Ser 111 1*1 ?? C 9 f 3 "a aaa caa tta ata cat gat att cat aaa 

Ser Ala A.g Ala Glu Leu Lys Gin Leu He His Asp He His Lys 

1255 1260 126 5 

s m tit s: s ;s s £ s s: 

1275 1280 

s £ss s a s; s: s« e - - -° - - ■« 

-l^-od 1290 



4299 



4347 



1295 



gaa gat cgt tca cca aqa aaa ant- t-«-t- 

Glu Asp Glv Ser Pro Arg p^ Jf! 111 cf 

1300 n nr Gly Gly G1 y Ar< 3 Leu Gly Thr 



1305 

ijio 1315 



1345 



1385 1390 



1395 



Thr So v 'f r? C f ? C " tCa 9at 93t att c 9t aat age ttg aaa 

Thr Asp ... val Gly val Phe Ser Asp Asp He Arg Asn Ser Leu Lys 



1400 1405 



1410 



tct ggt ttt cca aat gaa qqt act era nnc t-.-,- 

Ser » y P„ e Asn G 9 lu « y ^ HI HI HI J« •« «t «c cca 

-L4 lb i a n n __ 



1420 1425 



4395 



4443 



4491 



4539 



4587 



4635 



Thr nil f * m ? c 9 " C9t 9 " "9 9" 9 at tcc "t aaa tat ctt 

Thr H,s A.a Met Ser Arg Arg Val Leu Val Asp Ser lie Lys Tyr Leu 

1320 1325 133 0 

aca agt gaa ttt aaa gtt gat qqt *-tc car 1-1-t- a <- 

Thr Ser Glu Phe Lys Val Asd Glv pit tl 11 f 9 3 9 " a 9at 4683 

tjoc P Y Arg Phe As P Met M " Gly Asp 

1340 



s: s ss s iv. s« s: s £ s; - s: e -» 
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caa net tta caa ggt att ttt aaa aat ate aaa gca caa cct ggg aat 4971 
Gin Ser Leu Gin Gly lie Phe Lys Asn lie Lys Ala Gin Pro Gly Asn 



1430 



1435 1440 



ttt qaa gca cat teg cca gga gat gtg gtg cag tat att get gca cat 5019 
Phe Glu Ala Asp Ser Pro Gly Asp Val Val Gin Tyr He Ala Ala His 



1445 



1450 1455 



gat 
Asp 
1460 



aac ctt ace ttg cat gat gtg att gca aaa tea att 
Asn Leu Thr Leu His Asp Val He Ala Lys Ser He 

1465 1470 



5058 



<210> 23 
<211> 221 
<212> PRT 

<213> streptococcus 
<400> 23 



Asn Leu Lys Ala Glu Leu Ser Val Glu Asp Glu Gin Tyr Thr Ala Thr 

10 15 
Ser Thr Pro Gin Glu Gl; 
20 25 30 



1 5 10 15 

Val Tyr Gly Lys Ser Ala His Gly Ser Thr Pro Gin Glu Gly Val Asn 

20 25 30 

Glv Ala Thr Tyr Leu Ala Leu Tyr Leu Ser Gin Phe Asp Phe Glu Gly 



35 40 45 

Pro Ala Arg Ala Phe Leu Asp Val Thr Ala Asn He He His Glu Asp 

50 55 60 

Phe Ser Gly Glu Lys Leu Gly Val Ala Tyr Glu Asp Asp Cys Met Gly 

7 0 7 5 ^° 

Pro Leu Ser Met Asn Ala Gly Val Phe Gin Phe Asp Glu Thr Asn Asp 

85 90 95 

Asp Asn Thr He Ala Leu Asn Phe Arg Tyr Pro Gin Gly Thr Asp Ala 

100 105 HO 

Lys Thr He Gin Thr Lys Leu Glu Lys Leu Asn Gly Val Glu Lys Val 

H5 120 125 

Thr Leu Ser Asp His Glu His Thr Pro His Tyr Val Pro Met Asp Asp 

130 I 35 140 

Glu Leu Val Ser Thr Leu Leu Ala Val Tyr Glu Lys Gin Thr Gly Leu 

150 IBS 160 

III Gly His Glu Gin Val He Gly Gly Gly Thr Phe Gly Arg Leu Leu 

165 170 175 

Glu Arg Gly Val Ala Tyr Gly Ala Met Phe Pro Gly Asp Glu Asn Thr 

180 185 190 

Met His Gin Ala Asn Glu Tyr Met Pro Leu Glu Asn He Phe Arg Ser 

19 5 200 205 

Ala Ala He Tyr Ala Glu Ala He Tyr Glu Leu He Lys 
210 215 220 



<210> 24 
<211> 194 
<212> PRT 

<213> streptococcus 
<400> 24 
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Met Thr Asp Leu Glu Lys lie lie Lys Ala He Lys Ser Asp Ser Gin 

Asn Gin Asn Tyr Thr Glu Asn Gly H e l° sp Pro Leu phe Ala ^ pro 

2 0 05 
Lys Thr Ala Arg He Asn He Val Gly Gin Ala Pro Gly Leu Lys Thr 

Gin Glu Ala Arg Leu Tyr Trp Lys Asp Lys Ser Gly Asp Arg Leu Arg 

m ! 55 60 9 

Gin Trp Leu Gly Val Asp Glu Glu Thr Phe Tyr His Ser Gly Lvs Phe 

70 75 

Ala Val Leu Pro Leu Asp Phe Tyr Tyr Pro Gly Lys Gly Lys Ser Gly 

Asp Leu Pro Pro Arg Lys Gly Phe Ala Glu Lys Trp His Pro Leu lie 

105 

Leu Lys Glu Met Pro Asn Val Gin Leu Thr Leu Leu Val Gly Gin Tyr 

Ala Gin Lys Tyr Tyr Leu Gly Ser Ser Ala His Lys Asn Leu Thr Glu 

Thr Val Lys Ala Tyr Lys Asp Tyr Leu Pro Asp Tyr Leu Pro Leu Val 

150 155 
His Pro Ser Pro Arg Asn Gin lie Trp Leu Lys Lys Asn Pro Trp Phe 

Glu Lys Asp Leu lie Val Asp Leu Gin Lys lie Val Ala Asp He Leu 



-Loo 1 qn 

Lys Asp iyu 



<210> 25 
<211> 126 
<212> PRT 

<213> streptococcus 
<400> 25 

Met Arg Asp Asn His Leu His Thr Tyr Phe Ser Tyr Asp Cys Gin Thr 
Ala Phe Glu Asp Tyr He Asn Gly Phe Thr Gly Glu Phe He Thr Thr 

2 0 2 5 

Glu His Phe Asp Leu Ser Asn Pro Tyr Thr Gly Gin Asp Asp Val Pro 



3 5 

Asp Tyr Ser Ala Tyr Cys Gin lys He Asp Tyr Leu Asn Gin Lys Tyr 

Gly Asn Arg Phe Lys Lys Gly He Glu lie Gly Tyr Phe Lys Asp Arg 

70 75 go 

Glu Ser Asp H e Leu Asp Tyr Leu Lys Asn Lys Glu Phe Asp Leu Lys 

Leu Leu Ser lie His His Asn Gly Arg Tyr Asp Tyr Leu Gin Glu Glu 

Ala Leu Lys Val Pro Thr Lys Gly Ala Phe Ser Arg Leu llu 
115 120 125 

<210> 26 
<211> 931 
<212> PRT 

<213> streptococcus 
<400> 26 
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Met Lys Arg Lys 
1 

Arg Lys Leu Ser 

20 

Phe Leu His Ser 
35 

Ala Thr Thr Ala 
50 

Gin Ser Thr Asn 
65 

Val Thr Pro Ser 

Thr Asp Thr Gin 

100 

Leu Glu Glu Asn 
115 

Lys Val Ala Pro 
130 

Ala He Gly Ala 
145 

Arg He Glu Asn 

Gly His Pro Val 

180 

Gin Pro Ser Ser 
195 

Lys Lys Asp Asp 
210 

Gin Arg Lys Gin 
225 

Leu Ser Gly Asp 

Val Trp He Asp 

260 

Glu Gly Tyr Val 
275 

His Leu Ser Ala 
290 

Trp Pro Asp Gly 
305 

He Asp Val Ser 

Leu Asp Glu Ser 

340 

Tyr Val Phe Arg 
355 

Lys Asp Pro Lys 
370 

Leu Lys Asp Ala 
385 

Thr Thr Leu Asp 

Val Thr Asp Lys 

420 

Asp Thr Ser Lys 
435 



Asp Leu Phe Gly 
5 

Val Gly Val Ala 

Pro Gin Val Phe 

40 

He Ala Glu Ser 
55 

Leu Lys Asp Asp 
70 

Asp Met Pro Asp 
85 

Lys Gly Val Thr 

Lys Gly Pro Val 

120 

Ser Thr Leu Gin 
135 

Pro Ser Pro Thr 
150 

Gly Tyr Phe Arg 
165 

Glu Ser Thr Gly 

Asn Trp Pro Asn 

200 

Tyr Gly Tyr Tyr 
215 

He Ser Phe Leu 
230 

His His lie Pro 
245 

Glu Lys Tyr Gly 

Arg lie Asn Tyr 

280 

Trp Leu Phe Lys 
295 

Ser Asn Phe Val 
310 

Leu Lys Thr Asn 
325 

Lys Thr Gly Asp 

Asp Leu Ala Asn 

360 

Val Tyr Asn Asn 
375 

Gin Gin He Asp 
390 

Gly Val Asp Lys 
405 

Asn Gin Asn Ala 

Ser Leu Leu He 

440 



Asp Lys Gin Thr 
10 

Ser Val Thr Thr 

25 

Ala Glu Glu Val 

Asn lie Asn Gin 

60 

He Asn Ser Asn 
75 

Thr Lys Gin Leu 
90 

Glu Pro Asp Lys 
105 

Ser Asp Lys Asn 

Asn Thr Pro Asp 

140 

Leu Lys Val Ala 
155 

Leu His Leu Lys 
170 

Leu Trp He Trp 
185 

Gly Ala He Pro 

Val Asp Phe Lys 

220 

lie Asn Asn Lys 
235 

Leu Leu Arg Pro 
250 

He His Thr Tyr 
265 

Leu Ser Ser Ser 

Asp Val Ala Thr 

3 00 

Asn Gin Gly Leu 
315 

Ala Lys Glu He 
330 

Ala Val Lys Val 
345 

His Asn Gin He 

Pro Tyr Tyr He 

380 

Leu Thr Ser lie 
395 

Thr Glu lie Leu 
410 

lie Gin lie Ser 
425 

He Lys Gly Asp 



Gin Tyr Thr He 
15 

Gly Val Cys lie 
30 

Ser Val Ser Pro 
45 

Val Asp Asn Gin 

Ser Glu Thr Val 

80 

Val Ser Asp Glu 
95 

Ala Thr Ser Leu 
110 

Thr Leu Asp Leu 
125 

Lys Thr Ser Gin 

Asn Gin Ala Pro 

160 

Glu Leu Pro Gin 
175 

Gly Asp Val Asp' 
190 

Met Thr Asp Ala 
205 

Leu Ser Glu Lys 

Ala Gly Thr Asn 

240 

Glu Met Asn Gin 
255 

Gin Pro Leu Lys 
270 

Ser Asn Tyr Asp 
285 

Xaa Ser Thr Thr 

Tyr Gly Arg Tyr 

320 

Gly Phe Leu He 
335 

Gin Pro Asn Asp 
350 

Phe Val Lys Asp 
365 

Asp Gin Val Gin 

Gin Ala Ser Phe 

400 

Lys Glu Leu Lys 
415 

Asp He Thr Leu 
430 

Phe Asn Pro Lys 
445 
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Gin Gly His Phe 
450 

Gin Ser Trp Glu 
465 

Gly Ala Val Leu 

Ser Pro Ser Ala 

500 

Gin Asn Arg Val 
515 

Val Trp Gin Thr 
530 

Gly Tyr Tyr Tyr 
545 

lie Leu Asp Pro 

Val Asn Asp Asp 

580 

Gin Leu Gly Pro 
595 

Gly Arg Gin Asp 
610 

Ser Asp Arg Ser 
625 

Ala Ala Phe Ser 

His lie Gin Leu 

660 

Asp Lys Ser Arg 
675 

Trp Gly Tyr Asp 
690 

Glu Lys Pro Lys 
705 

lie His Asp lie 

Tyr Asn His Thr 

740 

Tyr Tyr His Phe 
755 

Gly Gly Arg Leu 
770 

Asp Ser lie Lys 
785 

Phe Asp Met Met 

Lys Glu Ala Lys 

820 

Trp Arg Thr Phe 
835 

Gin Asp Trp Met 
850 

lie Arg Asn Ser 
865 

Phe lie Thr Gly 



Asn lie Ser Tyr 
455 

Phe Lys Asp Gin 
470 

Asn Gin Asp Gly 
485 

Asp Ser Val Thr 

Val Ala Thr Thr 

520 

lie Leu Asp Thr 
535 

Leu Tyr Glu lie 
550 

Tyr Ala Lys Ser 
565 

lie Lys Thr Ala 

Gin Asn Leu Ser 

600 

Ala Val lie Tyr 
615 

Leu Asp Gly Lys 
630 

Glu Lys Leu Asd 
645 

Leu Pro Val Leu 

Ser Thr Ala Tyr 

680 

Pro Gin Ser Tyr 
695 

Asp Pro Ser Ala 
710 

His Lys Arg Gly 
725 

Ala Lys Thr Tyr 

Met Asn Glu Asp 

760 

Gly Thr Thr His 
775 

Tyr Leu Thr Ser 
790 

Gly Asp His Asp 
805 

Ala lie Asn Pro 

Gin Gly Asp Gin 

840 

Lys Ser Thr Asp 
855 

Leu Lys Ser Gly 
870 

Gly Pro Gin Ser 
885 



Asn Gly Asn Asn 

460 

Leu Tyr Ala Tyr 
475 

Ser Lys Val Glu 
490 

Met lie lie Tyr 
505 

Pro Leu Val Lvs 

Lys Leu Gly lie 

540 

Lys Arg Gly Lys 
555 

Leu Ala Glu Trp 
570 

Lys Ala Ala Phe 
585 

Phe Ala Lys lie 

Glu Ala His Val 

620 

Leu Lys Asn Gin 
635 

Tyr Leu Gin Lys 
650 

Ser Tyr Phe Tyr 
665 

Thr Ser Ser Asp 

Phe Ala Leu Ser 

700 

Arg lie Ala Glu 
715 

Met Gly Val lie 
730 

Leu Phe Glu Asp 
745 

Gly Ser Pro Arg 

Ala Met Ser Arg 

780 

Glu Phe Lys Val 
795 

Ala Ala Ala lie 
810 

Asn Met lie Met 
825 

Gly Gin Pro Val 

Thr Val Gly Val 

860 

Phe Pro Asn Glu 
875 

Leu Gin Gly lie 
890 



Val Met Thr Arg 

Ser Gly Asn Leu 

480 

Ala Ser Leu Trp 
495 

Asp Lys Asp Asn 
510 

Asn Asn Lys Gly 
525 

Lys Asn Tyr Thr 

Asp Lys Val Lys 

560 

Asp Ser Asn Thr 
575 

Val Asn Pro Ser 
590 

Ala Asn Phe Lys 
605 

Arg Asp Phe Thr 

Phe Gly Thr Phe 

640 

Leu Gly Val Thr 
655 

Val Asn Glu Met 
670 

Asn Asn Tyr Asn 
685 

Gly Met Tyr Ser 

Leu Lys Gin Leu 

720 

Leu Asp Val Val 
735 

He Glu Pro Asn 
750 

Glu Ser Phe Gly 
765 

Arg Val Leu Val 

Asp Gly Phe Arg 

800 

Glu Leu Ala Tyr 
815 

He Gly Glu Gly 
830 

Lys Pro Ala Asp 
845 

Phe Ser Asp Asp 

Gly Thr Pro Ala 

880 

Phe Lys Asn He 
895 
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Lys Ala Gin Pro 

900 

Gin Tyr He Ala 
915 

Lys Ser He 
930 



Gly Asn Phe Glu 

Ala His Asp Asn 

920 



Ala Asp Ser Pro 
905 

Leu Thr Leu His 



Gly Asp Val Val 
910 

Asp Val He Ala 
925 



<210> 27 

<211> 5607 
<212> DNA 

<213> streptococcus 



<220> 

<221> CDS 

<222> (2) . . . (301) 



<400> 27 

a att caa agt ttg aca gaa ggt caa ctt cgt tct gat ate cct gag ttc 
He Gin Ser Leu Thr Glu Gly Gin Leu Arg Ser Asp He Pro Glu Phe 
1 5 10 15 

cat get ggt gat act gta cgt gtt cac get aaa gtt gtt gaa ggt act 
Arq Ala Gly Asp Thr Val Arg Val His Ala Lys Val Val Glu Gly Thr 

20 25 30 

cqc gaa cgt att cag ate ttt gaa ggt gtt gtt ate tea cgt aaa ggt 
Arg Glu Arg He Gin He Phe Glu Gly Val Val He Ser Arg Lys Gly 



49 



35 



40 45 



caa gga ate tea gaa atg tac aca gta cgt aaa att tct ggt ggt ate 
Gin Gly He Ser Glu Met Tyr Thr Val Arg Lys He Ser Gly Gly He 
50 55 60 

qqt qta gag cgt aca ttc cca att cac act cct cgt gtt gat aaa ate 
Glv Val Glu Arg Thr Phe Pro He His Thr Pro Arg Val Asp Lys He 

nr\ 75 80 

65 70 

qaa gtt gtt cgt tat ggt aaa gta cgt cgt get aaa ctt tac tac tta 
Glu Val Val Arg Tyr Gly Lys Val Arg Arg Ala Lys Leu Tyr Tyr Leu 

85 .90 95 

cgc gca ttg caa ggtaaagctg caegtattaa agaaatccgt cgttaatttt 

Arg Ala Leu Gin 

100 

gatgatcaga ttttaaaaat gcttggttgt ttgaggatag taactatgtt ttaaaactgg 
acaaccaaga cgtaaaaaat ctgcctgtgg gcagtttttt tactaggtcc ccttagttca 
atggatataa caactccctc ctaaggagta attgctggtt cgattccggc aggggacata 
ttcattgeat gtaaatagcg gtttagagct attttgcccc aaatttctct gattaagttt 
ategttccta tctttttgtt cttgtaattg atgtgcgtaa acttctaaag tgatatttaa 
attctcgtga tctaaaactt gagagatgga aattagatag ettgeaaatg tatgectgag 
agagtgeact cgtacctcgc gaccagttat tttteggata gttttattga ctgeattatt 
tqaaagtttg tcgaataatc tgtcgttttt attttttgta aattcatgea aaaaaaataa 
tgtatcattg tcaattggta tatttetgat actacttttg ttttttgttg gcaggtatct 
ttggttgaaa tgataatccc aagttttatt aattgataaa tatttgttag tgtaatcaat 
atcattaact gttaaaccta aacattcagc gaagcgcatg ccagttttag cgatgaggta 



97 



145 



193 



241 



289 



341 



401 
461 
521 
581 
641 
701 
761 
821 
881 
941 
1001 
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S tacgattgat gttgtgattt ttctttacaa atttttatca agcgtaagta 1061 

ttcattggtt tcaagaaatt ttatctctat tnacgcccct tattttttgc rttaacctta 1121 

gtgaataaac aaaaattttt ttctatatat ccctcgtgaa cagccacgga tacgcaggc' llll 

tttacatgta tgttaaaacg ctttactgta tcttgcacat gcgtttgact ataatgStt llll 

atgacttgtt gatatttagc ggaagtaata ttgcaaagta atatatttcc tattaLcgt 

ctc««t» «S«« tC CCaCCCSCCS "gcgtttae ggaaatacgc cattgataL 36 

!'!!!!!!!! gctaaagaac agggcgtcca aggctacctt gatggaaaag gctctcttag 1421 

agatatttgt aaatggtatg atatctcaag tcgctctgtt ctccaaaagt ggacaaaacg „ 

gtatactagt ggtgaagact tgaaagccac tagtagagga tatagccgta tgaaacaagg 1541 

aaggcaagcc acatttgaag aacgtgtaga gattgttaac cacaccattg cccatgggf! Hoi 

agactatcaa gcagctattg agaagtttgg tgtttcctac caacaaattt attcti^gt llll 

gcgtaagctt gagaagaatg gctcacaagg cttggtCgat agacgtgtga aagggt t gga 17 " 

gagtaggcct gatttaaccg agattgagca actttaactc aagattLL aa^ggaSa llll 

acgtaatcgt ctcttagaaa tcgaggttag tttactaaaa aagttagaag acatca^cg llll 

aggaaacaga cggtaagact aggtaagcat ttagcggagt tccaagtaa? caagaaSH llol 

tacgatgagg aatctaatgt gcctattcag gcctratgcc aactcttgaa ggggtctcgt 1961 

" C !fr ta " acaa 9tg 3 ct caatcgtcaa aaaacagatt ttgagacaaa aaKacaaag loll 

ctaatggcta aaatcaagga acttcgtaga ctctacaatg gtatcttagg ttatcgccg? 208^ 

atgacaacat ttattaatcg tcaacttggg acaacttaaa acaagaaacg gattc^ttS -HI 

ttgatgaaca ttctggggat tagttcagtc attcgtcgtg ttagccatgc ttgtac^aaa 22^ 

gctggtgaca gattttacga agaaaatatt cttaatcgtg aatttaca^c caSjc"" 226^ 

ctTaTaalt ^f 9 "" 9 * tgtcacctat cttcaatacg gtctgggagc taaagcttat 2321 

a^JIaat^ lr.tt a tt C t 9tataac 99 C tctattatcg cttatgagat tagtcacaac 2381 

!!!*!" tC ! ac "9 ttat 9 aagaccatta aaaaggggct agagctcaat ccaggagcca 2441 
cacccatcat ccatagcgat tgaggtagtc aatatacttc caaagaatac cgtJftaTca 

tacaacaagc tggtctgacc ttatccatgt cccggattgg caaatgtatt gltaatg^c fsll 

caactgaaag tttctttggg tttttcaaga ctgagtctta ccaccttaag aatac llll 

atc»t»C lllllT^ ^atgcggcac gttatatcga attctacaac acacaacgct 2ell 

" " a a "" "gactcctc tagaattcag gaatcaggtt gcataactta 2741 

itttqc^a tacttgacag ggagccgttc agattgctta acctttctaa 2801 

atttgctaaa atagctacaa gaaaacgagc catttaatgc ttatttctta tactgtcttq 2861 

ItlZT^l " CCtC9acca aaaattgagc gtgaggcttt ctgttrcatt aaacgltga"? 292^ 

atttccatat tcaccagttt gttttccgag agccatcaaa gcctcgataa ggtcgataat 298^ 

Saaa a ?t a a ! 9gtaataC taaaaa ^ atataaaaaa acctglcc.a HttLc^c 3041 

S""" t9C9CtCCaa tgccgcccaa aagaacgtca ataaaacata aactactatg 3101 

ttagcataag actttatttt tacaactgaa tttcatataa atggattaga gtaagggata 3161 

Ttllllllll !«« 9C J Ct ttt9aaaata aaaaaa "aa tataatatgg L..S?tt t llll 

!!"!!!!! C9tttcataa aaggtatgta atctagtatt taggcaacac tattttgtca 3281 

ctggtgtcta gtaacttata gattgataat tttactagta aacgtaattc ttcgctttaa 3341 

S S lT tttat l 9taa9Ctaaa "gggaggtg aacttatgta aaaUagata 3 

IS!"! agtacgggat gattattgaa acagccagta tgcatcataa aatctgtatt 3461 

SotS;;: caat; CC " a aCCagacatC a 9"cattgt ttatcatcgc tacccLagt 3S21 

!!!*""" = aata 9 a 9 ca ttaggtagtt tttgataata aaactatata aacatgagaa 3581 

itacoaaat? ta " 9CattC "cataatga gttatttgag attttccttt gaataaaLg 3s" 

laaaaatrtJ !!L "! atatataaac 9 3 ctctatca ttgagatagt ttgtcaaatf 37 01 

ttltltrllt " at " aaat ^ttttaaaa acattagttg taggcgatgt aaaaatatta 3761 

JaJtactf?! oI!! a ! a9tt 9C99a9taaa aatagagagg agtaattagg aagtgataaa 3821 

aaatgctata gcatatatta ccagaaaaaa aaatagaaca cttattatat ttgctatttt 3881 

T.llTaltl tl Z lT SCt t9tattCat9 "taacaata atgaaatcaa gtaatgaaac llll 
agaaaaggct ttatatgaaa gttctaattc ttcaatatca attacaaaaa aagatggtaa 4 001 

t C " "" at !! at ttaa 9 aatat tgaaaaaata aaagaggttg aagaaaaaat 4061 

atttcaatat gatggattag caaaattgaa agatcttaaa gtagttagtg gtgagcaaag 4121 

™ aa S a 9aa9atttat "gacgaatt taaaaatgtt gtttcacta| LJcf.c.ag till 

" ! 9 ! aa ! Cttt tattta 9tag tggagtattt agttttaaag aaggaaaaaa 4241 

actJaaa?!! llr* **** attCaaCCct ^gttcatgaa gaatttgcta aacaaaacaa 4301 

actaaaattg ggcgacgaaa ttgatcttga attactagat acggaaaaaa gtggaaaaat 4361 
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aaaaagtcat aaatttaaaa ttataggaat cttttctggt aaaaaacagg aaacatatac 
aSaatnatca tctgatttta gcgaaaatat ggtttttgta gattattcaa ctagccaaga 
SSttlaat aaatcagaga ataatagaat tgcaaataaa attttaatgt attctggtag 
tt-aaaatct acagagcttg ccttaaacaa attgaaagac tttaaaattg ataagtcaaa 
gt^tctatt aaglalgata ataaagcatt cgaagagtct -agagtcag tgagtggaat 
aaaacatata attaaaataa tgacttattc gattatgtta ggtggaatag "gttctttc 
attaatcttg attctatggt taagagaaag aatttatgia * ta ^"tat "ttatctat 
tggaacaact aagatacaaa ttataaggca atttatattt gagttaatat tcatatcaat 
accaagtata atatcctcct tatttttagg gaatctacta ttaaaagtaa ttgtagaagg 
atttattaac tcagagaact caatgatttt cggtggaagt ttaataaata a ^cagttt 
tatgttaaac ataacaacac ttgcagaaag ttatttaata ttaataagta "attgtttt 
atcaqttqta atggcctctt cattaatatt atttaagaaa ccacaagaaa tattatcaaa 
aataaqttag gagcaaataa tggatatatt agaaataaag aatgtaaatt acagttacgc 
aaattctaaa gaaaaagttt tgtcaggagt aaatcaaaaa tttgaacttg 9««gtttta 
tqcgatagta gqgaagtcag gaacaggaaa atccacactt ctttccttac ttgcaggact 
SSaaJtt clLclggaa aaatcttgtt taagaatgaa gatatagaaa agaaaggata 
tagtaatcac agaaaaaata atatatcttt ggtatttcaa aattataatt ^aatagatta 
tttatcgccg atcgaaaata ttagactagt aaataaatca gtagatgaga gtatcttgtt 
caaattaggt ttagataaaa aacaaataaa aagaaatgtt atgaaattat ctggtggtca 
gcaacaaagg gtagctattg ctagggcact ggtatcagat gccccaataa tactagctga 
tgagcccacc ggcaacctag acagtgttac tgctggagaa ataatt 

<210> 28 

<211> 111 - 
<212> PRT 

<213> streptococcus 

He Ginger Leu Thr Glu Gly Gin Leu Arg Ser Asp He Pro Glu Phe 

5 10 15 

Arq Ala Gly Asp Thr Val Arg Val His Ala Lys Val Val Glu Gly Thr 

20 25 30 

Arq Glu Arg He Gin He Phe Glu Gly Val Val He Ser Arg Lys Gly 

35 40 45 

Gin Glv He Ser Glu Met Tyr Thr Val Arg Lys He Ser Gly Gly He 

50* 55 60 n 

Gly Val Glu Arg Thr Phe Pro He His Thr Pro Arg Val Asp Lys lie 

65 70 75 

Glu Val Val Arg Tyr Gly Lys Val Arg Arg Ala Lys Leu Tyr Tyr Leu 



85 90 95 

Arg Ala Leu Gin Gly Lys Ala Ala Arg He Lys Glu He Arg Arg 

100 105 HO 

<210> 29 
<211> 173 
<212> PRT 

<213> streptococcus 

<400> 29 _ „ 

Met Arg Phe Ala Glu Cvs Leu Gly Leu Thr Val Asn Asp He Asp Tyr 

1 5 10 15 

Thr Asn Lys Tyr Leu Ser He Asn Lys Thr Trp Asp Tyr His Phe Asn 

20 25 30 

Gin Arg Tvr Leu Pro Thr Lys Asn Lys Ser Ser He Arg Asn He Pro 

35 40 45 

He Asp Asn Asp Thr Leu Phe Phe Leu His Glu Phe Thr Lys Asn Lys 



4421 

4481 

4541 

4601 

4661 

4721 

4781 

4841 

4901 

4961 

5021 

5081 

5141 

5201 

5261 

5321 

5381 

5441 

5501 

5561 

5607 
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a 1° 55 60 

Asn Asp Arg Leu Phe Asp Lys Leu Ser Asn Asn Ala Val Asn Lys Thr 
55 70 75 

He Arg Lys lie Thr Gly Arg Glu Val Arg Val His Ser Leu Arg His 

85 90 95 

Thr Phe Ala Ser Tyr Leu lie Ser lie Ser Gin Val Leu Asp His Glu 

100 105 110 

Asn Leu Asn He Thr Leu Glu Val Tyr Ala His Gin Leu Gin Glu Gin 

115 " 12 ° 125 

1.VS Asp Arg Asn Asp Lys Leu Asn Gin Arg Asn Leu Gly Gin Asn Ser 
130 135 140 



Ser Lys Pro Leu Phe Thr Cys Asn Glu Tyr Val Pro Cys Arg Asn Arg 

Z 150 155 160 

Thr Ser Asn Tyr Ser Leu Gly Gly Ser Cys Tyr He His 



<210> 30 
<211> 389 
<212> PRT 

<213> streptococcus 



<400> 30 

Met Lys Ser Ser Asn Glu He Glu Lys Ala Leu Tyr Glu Ser Ser Asn 

10 15 

Ser Ser He Ser He Thr Lys Lys Asp Gly Lys Tvr Phe Asn He Asn 

20 25 30 

Gin Phe Lys Asn He Glu Lys He Lys Glu Val Glu Glu Lvs He Phe 

35 40 45 

Gin Tyr Asp Gly Leu Ala Lys Leu Lys Asp Leu Lys Val Val Ser Glv 
^ !? 55 60 

Glu Gin Ser He Asn Arg Glu Asp Leu Ser Asp Glu Phe Lys Asn Val 

70 75 
Val ser Leu Glu Ala Thr Ser Asn Thr Lys Arg Asn Leu Leu Phe Ser 

85 90 95 

Ser Gly Val Phe Ser Phe Lys Glu Gly Lys Asn He Glu Glu Asn Asp 

*A» \* \*J 



105 



110 



Lys Asn Ser He Leu Val His Glu Glu Phe Ala Lys Gin Asn Lys Leu 

5 120 125 

Lys Leu Gly Asp Glu He Asp Leu Glu Leu Leu Asp Thr Glu Lys Ser 

130 13 5 140 

Gly Lys He Lys Ser His Lys Phe Lys He He Gly He Phe Ser Gly 

T t , 150 15 5 160 

Lys Lys Gin Glu Thr Tyr Thr Gly Leu Ser Ser Asp Phe Ser Glu Asn 

165 17 ° 17=5 

Met Val Phe Val Asp Tyr Ser Thr Ser Gin Glu He Leu Asn Lys Ser 

185 190 

Glu Asn Asn Arg He Ala Asn Lys He Leu Met Tyr Ser Gly Ser Leu 

195 2 °0 205 

Glu Ser Thr Glu Leu Ala Leu Asn Lys Leu Lys Asp Phe Lys He Asp 

"° 215 22 0 

Lys Ser Lys Tyr Ser He Lys Lys Asp Asn Lys Ala Phe Glu Glu Ser 

r „ n 230 235 240 

Leu Glu Ser Val Ser Gly He Lys His He He Lys He Met Thr Tyr 

245 25 0 255 

Ser He Met Leu Gly Gly He Val Val Leu Ser Leu He Leu He Leu 

260 265 270 
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Trp Leu Arg Glu Arg He Tyr Glu He Gly He Phe Leu Ser He Gly 

275 280 285 

Thr Thr Lys He Gin He He Arg Gin Phe lie Phe Glu Leu He Phe 

290 * 295 300 

lie Ser He Pro Ser lie lie Ser Ser Leu Phe Leu Gly Asn Leu Leu 
305 310 315 320 

Leu Lys Val lie Val Glu Gly Phe lie Asn Ser Glu Asn Ser Met He 

325 330 335 

Phe Gly Gly Ser Leu lie Asn Lys Ser Ser Phe Met Leu Asn He Thr 

340 345 350 

Thr Leu Ala Glu Ser Tyr Leu He Leu lie Ser He He Val Leu Ser 

355 360 365 

Val Val Met Ala Ser Ser Leu lie Leu Phe Lys Lys Pro Gin Glu He 

370 375 380 

Leu Ser Lys lie Ser 
385 

<210> 31 
<211> 169 
<212> PRT 

<213> streptococcus 
<400> 31 

Met Asp He Leu Glu He Lys Asn Val Asn Tyr Ser Tyr Ala Asn Ser 

1 5 10 15 

Lys Glu Lys Val Leu Ser Gly Val Asn Gin Lys Phe Glu Leu Gly Lys 

20 25 30 

Phe Tyr Ala lie Val Gly Lys Ser Gly Thr Gly Lys Ser Thr Leu Leu 

35 40 45 

Ser Leu Leu Ala Gly Leu Asp Lys Val Gin Thr Gly Lys He Leu Phe 

50 55 60 

Lvs Asn Glu Asp lie Glu Lys Lys Gly Tyr Ser Asn His Arg Lys Asn 
6S 70 75 80 

Asn lie Ser Leu Val Phe Gin Asn Tyr Asn Leu He Asp Tyr Leu Ser 

85 90 95 

Pro lie Glu Asn He Arg Leu Val Asn Lys Ser Val Asp Glu Ser He 

100 105 HO 

Leu Phe Glu Leu Gly Leu Asp Lys Lys Gin lie Lys Arg Asn Val Met 

115 120 125 

Lys Leu Ser Gly Gly Gin Gin Gin Arg Val Ala lie Ala Arg Ala Leu 

130 135 140 

Val Ser Asp Ala Pro He He Leu Ala Asp Glu Pro Thr Gly Asn Leu 
145 150 155 160 

Asp Ser Val Thr Ala Gly Glu lie lie 

165 

<210> 32 
<211> 4171 
< 2 1 2 > DN A 

<213> Streptococcus 
<400> 32 

catatgacaa tatttttcaa agtctacatc acttactcgc ctgtcgtgga aaatctggca 
atacattaat cgaccaatta gttgctgatg gtttacttca tgcagataat cactaccatt 
ttttcaatgg gaagtctctg gccactttca atactaacca attgattcgc gaagttgtct 
atgttgaaat atccttagat actatgtcta gtggtgaaca tgatttagta aaagttaaca 



60 
120 
180 
240 
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300 
360 
420 
480 



a«q"Cn « " catacta *" ccacgargat gacagctagc ccctatcacc 
ttn««qec Tallltttl 9Ca9aCCaaa ".c.t.cc. aatggagggt gegctagcag 
c^ciSatt ca *9ttgaca caaaaccatt taaagaagaa gtaaaacarc 

rcaaS^ca JeSSS^ CCt9CaaCtg "agcttc.c acacactgac agttatagtc „„ 

.caatgacta tcttctttct cgtggttttg ctaatatata cgtttcaggt gtqqotacto 540 

ctggctctac gggttccatg accagtgggg attaccaaca alt-.r.^ltl g ^ 9ggtactg 540 

t _ 3r ^ = saaa attaccaaca aatacaaagc tttaaagcag 600 

tcattgactg gttaaatggt aaggttactg cattcacaag tcataaacga gataaacaaq sso 

tcaaggctga trggtcaaac ggccttgtag caaccacagg taaatcttat ctcggtaccf 720 

cgtcaactgg tttagcaaca actggcgttg aggggctgaa agtcattatc qct S."™ II. 



- 1 "--^"^) aaaaiyyytc ug ug cgnaqt ccaqqcqqrt 

a ^ ! 3 !"" aaCa 9 — ata ctc.cJ.aL ctcSaJS 



840 



=«=«cS .^ssn tfj^s SUSS £££S !8 ° 

esse -^-^ »™* s;. 0 



.. g =, t t g „- 900 
IVASZl" *■•«;«=«, 9 ca« 9 ;c;; 1200 



tltrt 9 r C t aa9t9399at ta «accaat actggcatga ccgtaattac ctaactcacg 
caiaaeaSJ "aaagtcga 9tagtttaca ctcatggact acaggattgg aatgttaagc 
tacf a c a ^o S aCaaagtt "caatgcat tgcctcaaac catcaaaaaa cacctttt?t 
TvtT*,?? ! ataC 9cata a t tggcag t c gatngatttt cgtgaaagca 



960 
1020 
1080 
1140 



1860 
1920 
1980 



accatcaaga gcaaatcggt ttaggtgata gtaaaaaact tattgataac cattatgaca 1380 

Tlltlllll Sr" 0 ^ 3 ' t9taaagact tctgtgtt caaaLtgat cttttcCgg Hlo 

gaaataataa aaccaatcaa atcactatta atcttcctct aaagaaaaat tatctcctqa i nn 

atggacagtg caaactccat ctacgtgtta aaactagtga caaaaaggcc atttScaq lllo 

cccaaatctt agactatggt cctaaaaaac gattcaaaga tacaccILc atcJaJttc? \sll 

taaacagcct tgataatggt aaaaattttg ccagagaagc tttacgtgaa ctcccqttta III* 

ctaaagatca ttatcgtgtc atcagtaaag gtgtcttgaa ccttca^aat cgta««« lllo 

ttlltltllt TJgZltt 9a9CCa9aaC aa tggt tt ga tatcgagttt 
caagtatata tcaattgagt aaaggtgata atctaaggat tatcctttat acaactaatr 
ttgaacatac cactcgagat aatgctagtt actctataac agtagattJg a^ca^ctt 
ttaataattc « C " Ct " t Caaggaaa " aacttatgaa acttcttac? JaagaScggt 
" 9a J 93 ^ C Ccaacac "t tggtaccaga tcaatttatt acaagagagt aacftcggag 2040 
3 ^atgataat aaaaacattc cacaggttgt tgcaac?.?t gttgatjatt Hoi 
tair 9 ^^ 9 3atCatttct 9gtattttgg caatactact Jatacttcca 2160 
actttocacJ cllltt % 3atC9aaaat tctatattca ggttaattta Lggactttg 2220 
iaqctctaH Znttt* ^ 9Ctataaata a "ggaagag tctcctccaa actcaacttj 2280 
aagctctaaa cgatacccta gcaatatttc aataaataag gtagaatgga qtqacaaaqc ">340 

"altqaJS IT^T^ attggaataa ct.fcctgl. ct^aatfa 2 0 

aaqcqqcqcJ ? ctaa "caag aaagaataca ggtcagaaat aaagatattg 2460 

a"cct«t?tc " acagctgcgg gaggaaaaca gctcagacca gcattcttt? 2520 

^acttcttt 9 aataaggaga atcaagatac tcagcaacta aagaaaatcg 2580 

9 ac aaa of I cacgttg cta cattaatcca tgatgatgtc attgatgact 2640 
a = gtggaaat atgaccattc aaagcaagtt tggcaaagac atcgcagttt 

S ltlatttt ZC , aCa9tCtttt tcgatcttat ttcagaaLt atgactla^a 
caccatttat gaggattaat gcaaaatcta tgcgtaaaat tctcatggga gaattqqacc 
agatgcacct tcgttacaat caacaacaag gtatccatca ctattta^t JcglScaq ,« eu 

~ IFF" ~« « SSSS5 =22= 

Jtgcttatc 3 13 ga " aaaaaa cagatatggc taccgaagac atggaaaaal 

ttactSaS SctS , "^gaggtg ttgacaaagc tcgccatcta gctcgtaaat 

tact™ 33 It 9 9acataaata agctacccca gaactctgca aaaaaacagt 

L Jaao 33 ^ 30 " tttaaaac 9«-..tt t . aataataaaa aaacattcca 3360 

aa a cltc;?a 9 f 4 gggaatgt " tcttattatc atttatttat cgcacctatc 3420 

"caS oaqa^f atCagcggCt "cagctgac ggtaacgttg actactttga 3480 

tllatl^ll 9aggagaacc "ccaactct aattgcccat tttctataaa taagatacL 

tcagcatgtt caataccttt taagtgatgt gtaatccaaa ctaaqgtctt accLcaa 



2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 



3540 
3600 
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tctttcataa atacccttag taaggcttgt tcagtaatag gatcaagtcc aacagttggc 
tcatctaaga taacaattgg gacatctttt agtaagattc tagccaaagc aattctatgc 
ctttcgccac ctgaaaacct aagcccagct ccaccaacca ttgtatagag accatctgat 
aaatcagtga ccatctcttt caatccaact cgttcaagaa ctttccatac atcttcttca 
ctagcatctt ggtttccaat gcgaatgtta tttagcaggg ttgtattaaa aaggtagggc 
gcttgttgta tcactccaat atagttagaa atgcaatcac caactattga aacatcagca 
ccgcctaggg taatcttccc ttgacttgct ttcaagtcgc cacgaagtag accagctaag 
gtactcttgc cagaaccact ccgccctaaa atagcaattt tttctccttc tttaatatcc 
aaatctaaat gatgcaaaac ccatttctct tgtggcttat actggaaact taaattcttg 
acggaaaaat catatggctt attaggcaat t 



3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4171 



<210> 33 

<211> 649 

<212> PRT 

<213 > Streptococus 



<400> 33 
Tyr Asp Asn He 
1 

Lys Ser Gly Asn 

20 

His Ala Asp Asn 
35 

Phe Asn Thr Asn 
50 

Leu Asp Thr Met 
65 

He Arg Pro Thr 

Pro Tyr His Gin 

100 

Gin Met Glu Gly 
115 

Asp Thr Lys Pro 
130 

He Ser Pro Ala 
145 

Asn Asp Tyr Phe 

Val Gly Thr Ala 

180 

Gin He Gin Ser 
195 

Thr Ala Phe Thr 
210 

Ser Asn Gly Leu 
225 

Ser Thr Gly Leu 

Ala Glu Ala Ala 

260 

Leu Val Cys Ser 
275 

Thr Glu Leu Thr 
290 

Asn Asn Asp Cys 



Phe Gin Ser Leu 

5 

Thr Leu He Asp 

His Tyr His Phe 

40 

Gin Leu He Arg 
55 

Ser Ser Gly Glu 
70 

Thr Glu His Thr 
85 

Gly He Asn Asp 

Ala Leu Ala Val 

120 

Phe Lys Glu Glu 
135 

Thr Glu Ser Phe 
150 

Leu Ser Arg Gly 
165 

Gly Ser Thr Gly 

Phe Lys Ala Val 

200 

Ser His Lys Arg 
215 

Val Ala Thr Thr 
230 

Ala Thr Thr Gly 
245 

He Ser Thr Trp 

Pro Gly Gly Tyr 

280 

Tyr Ser Arg Asn 
295 

Tyr Gin Ala Leu 



His His Leu Leu 
10 

Gin Leu Val Ala 
25 

Phe Asn Gly Lys 

Glu Val Val Tyr 

60 

His Asp Leu Val 
75 

He Pro Thr Met 
90 

Pro Ala Ala Asp 
105 

Lys Gin Pro Lys 

Val Lys His Pro 

140 

Thr His lie Asp 
155 

Phe Ala Asn He 
170 

Phe Met Thr Ser 
185 

He Asp Trp Leu 

Asp Lys Gin Val 

220 

Gly Lys Ser Tyr 
235 

Val Glu Gly Leu 
250 

Tyr Asp Tyr Tyr 
265 

Pro Gly Glu Asp 

Leu Leu Ala Gly 

300 

Leu Asn Glu Gin 



Ala Cys Arg Gly 
15 

Asp Gly Leu Leu 
30 

Ser Leu Ala Thr 

.4 5 

Val Glu He Ser 

Lys Val Asn He 

80 

Met Thr Ala Ser 
95 

Gin Lys Thr Tyr 
110 

His He Gin Val 
125 

Ser Lys Leu Pro 

Ser Tyr Ser Leu 

160 

Tyr Val Ser Gly 
175 

Gly Asp Tyr Gin 
190 

Asn Gly Lys Val 
205 

Lys Ala Asp Trp 

Leu Gly Thr Met 

240 

Lys Val He He 
255 

Arg Glu Asn Gly 
270 

Leu Asp Val Leu 
285 

Asp Tyr He Lys 
Ser Lys Ala He 
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305 

Asp Arg Gin Ser 

Leu Thr His Val 

340 

Leu Gin Asp Trp 
355 

Ala Leu Pro Gin 
370 

His Val Tyr Met 
385 

Asn Ala Leu Leu 

Leu Glu Glu Val 

420 

Val Leu Asp Ala 
435 

Asp Ser Lys Lys 
450 

Thr Tyr Cys Lys 
465 

Asn Asn Lys Thr 

Tyr Leu Leu Asn 

500 

Asp Lys Lys Ala 
515 

Lys Arg Phe Lys 
530 

Asn Gly Lys Asn 
545 

Lys Asp His Tyr 

Arg Thr Asp Leu 

580 

Asp lie Glu Phe 
595 

Asp Asn Leu Arg 
610 

Arg Asp Asn Ala 
625 

Leu Thr lie Pro 



310 

Gly Asp Tyr Asn 
325 

Asn Asn Val Lys 

Asn Val Lys Pro 

360 

Thr lie Lys Lys 
375 

His Asn Trp Gin 
390 

Ser Gin Glu Leu 
405 

He Trp Gin Asp 

Phe Gly Gly Asn 

440 

Leu He Asp Asn 
455 

Asp Phe Asn Val 
470 

Asn Gin He Thr 
485 

Gly Gin Cys Lys 

He Leu Ser Ala 

520 

Asp Thr Pro Thr 
535 

Phe Ala Arg Glu 
550 

Arg Val He Ser 
565 

Leu Thr lie Glu 

Ser Leu Gin Pro 

600 

lie He Leu Tyr 
615 

Ser Tyr Ser He 
630 

Thr Asn Gin Gly 
645 



315 

Gin Tyr Trp His 
330 

Ser Arg Val Val 
345 

Arg His Val Tyr 

His Leu Phe Leu 

380 

Ser He Asp Phe 
395 

Leu Gly lie Asp 
410 

Asn Thr Thr Glu 
425 

His Gin Glu Gin 

His Tyr Asp Lys 

460 

Phe Lys Asn Asp 
475 

He Asn Leu Pro 
490 

Leu His Leu Arg 
505 

Gin He Leu Asp 

lie Lys Phe Leu 

540 

Ala Leu Arg Glu 
555 

Lys Gly Val Leu 
570 

Ala lie Glu Pro 
585 

Ser lie Tyr Gin 

Thr Thr Asp Phe 

620 

Thr Val Asp Leu 
635 

Asn 



320 

Asp Arg Asn Tyr 
335 

Tyr Thr His Gly 
350 

Lys Val Phe Asn 
365 

His Gin Gly Gin 

Arg Glu Ser Met 

400 

Asn His Phe Gin 
415 

Gin Thr Trp Gin 
430 

lie Gly Leu Gly 
445 

Glu Ala Phe Asp 

Leu Phe Lys Gly 

480 

Leu Lys Lys Asn 
495 

Val Lys Thr Ser 
510 

Tyr Gly Pro Lys 
525 

Asn Ser Leu Asp 

Leu Pro Phe Thr 

560 

Asn Leu Gin Asn 
575 

Glu Gin Trp Phe 
590 

Leu Ser Lys Gly 
605 

Glu His Thr He 

Ser Gin Ser Tyr 

640 



<210> 34 

<211> 119 

<212> PRT 

<2 13 > Streptococus 



<400> 34 

Met Lys Leu Leu Thr Lys Glu Arg 

1 5 
Tyr Gin lie Asn Leu Leu Gin Glu 

20 

His Asp Asn Lys Asn lie Pro Gin 
35 40 



Phe Asp Asp Ser Gin His Phe Trp 

10 15 
Ser Asn Phe Gly Ala Val Phe Asp 
25 30 
Val Val Ala Thr lie Val Asp Asp 

45 
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Leu Gin Gly Ser Gly 
50 

Thr Asp Thr Ser He 
65 

He Gin Val Asn Leu 

85 

He Asn Asn Trp Lys 

100 

Asp Thr Leu Ala He 
115 



Ser Ser Asn His Phe Trp 
55 

Leu Met He Ala His Leu 
70 75 
Lys Asp Phe Asp Phe Ala 

90 

Ser Leu Leu Gin Thr Gin 

105 

Phe Gin 



Tyr Phe Gly Asn Thr 
60 

Asn Arg Lys Phe Tyr 

80 

Leu Asn Leu He Ala 

95 

Leu Glu Ala Leu Asn 
110 



<210> 35 

<211> 326 

<212> PRT 

<213> Streptococus 



<400> 35 
Met Ser Ser Tyr 
1 

Glu Thr Asn Gin 

20 

He Glu Ala Ala 
35 

Arg Pro Ala Phe 
50 

Gin Asp Thr Gin 
65 

His Val Ala Thr 

Arg Arg Gly Asn 

100 

Val Tyr Thr Gly 
115 

Glu Ser Met Thr 
130 

Arg Lys He Leu 
145 

Gin Gin Gin Gly 

Ala Glu Leu Phe 

180 

Ala Glu Lys Glu 
195 

Gly Met Thr Phe 
210 

Lys Lys Thr Phe 
225 

Tyr Ser Leu Pro 

Lys Pro He Leu 

260 

Lys He Ala Tyr 
275 

His Leu Ala Arg 
290 

Leu Pro Gin Asn 



Trp Asn Asn Tyr 
5 

Leu He Gin Glu 

Leu Ser Gin Leu 

40 

Phe Tyr Leu Phe 
55 

Gin Leu Lys Lys 
70 

Leu He His Asp 
85 

Met Thr He Gin 

Asp Leu Leu Phe 

120 

Asp Thr Pro Phe 
135 

Met Gly Glu Leu 
150 

He His His Tyr 
165 

Lys Leu Ala Ser 

Val Val Arg Leu 

200 

Gin He Leu Asp 
215 

Asn Lys Pro Val 
230 

Leu Leu Leu Ala 

245 

Asp Lys Lys Thr 

Leu Val Val Ser 

280 

Lys Phe Thr Glu 
295 

Ser Ala Lys Lys 



Pro Glu Leu Lys 
10 

Arg He Gin Val 
25 

Thr Ala Ala Gly 

Ser Gin Leu Gly 

60 

He Ala Ala Ser 
75 

Asp Val He Asp 
90 

Ser Lys Phe Gly 
105 

Thr Val Phe Phe 

Met Arg He Asn 

140 

Asp Gin Met His 
155 

Leu Arg Ala He 
170 

Lys Glu Gly Ala 
185 

Ala Gly His He 

Asp He Leu Asp 

220 

Leu Glu Asp Leu 
235 

He Glu Glu Asn 
250 

Asp Met Ala Thr 
265 

His Arg Gly Val 

Lys Ala He Ser 

300 

Gin Leu Leu Gin 



Lys Asn He Asp 
15 

Arg Asn Lys Asp 
30 

Gly Lys Gin Leu 
45 

Asn Lys Glu Asn 

Leu Glu He Leu 

80 

Asp Ser Pro Leu 
95 

Lys Asp He Ala 
110 

Asp Leu He Leu 
125 

Ala Lys Ser Met 

Leu Arg Tyr Asn 

160 

Ser Gly Lys Thr 
175 

Tyr Phe Gly Gly 
190 

Gly Phe Asn He 
205 

Tyr Thr Ala Asp 

Thr Gin Gly Val 

240 

Pro Asp He Phe 
255 

Glu Asp Met Glu 
270 

Asp Lys Ala Arg 
285 

Asp lie Asn Lys 
Leu Thr Asn Tyr 
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305 310 315 320 

Leu Leu Lys Arg Lys He 

325 

<210> 36 

<211> 247 

<212> PRT 

<213> Streptococus 

<400> 36 

Leu Pro Asn Lys Pro Tyr Asp Phe Ser Val Lys Asn Leu Ser Phe Gin 

15 10 is 

Tyr Lys Pro Gin Glu Lys Trp Val Leu His His Leu Asp Leu Asp He 

20 25 30 

Lys Glu Gly Glu Lys He Ala He Leu Gly Arg Ser Gly Ser Glv Lys 

35 40 45 

Ser Thr Leu Ala Ser Leu Leu Arg Gly Asp Leu Lys Ala Ser Gin Glv 

50 55 60 

Lys He Thr Leu Gly Gly Ala Asp Val Ser He Val Gly Asp Cys He 
65 70 75 80 

Ser Asn Tyr He Gly Val He Gin Gin Ala Pro Tyr Leu Phe Asn Th 



85 90 



r 

95 



Thr Leu Leu Asn Asn He Arg He Gly Asn Gin Asp^ Ala Ser Glu Glu 

100 105 110 

Asp Val Trp Lys Val Leu Glu Arg Val Gly Leu Lys Glu Met Val Thr 

115 120 12 5 

Asp Leu Ser Asp Gly Leu Tyr Thr Met Val Asp Glu Ala Gly Leu Arg 

130 135 140 

Phe Ser Gly Gly Glu Arg His Arg He Ala Leu Ala Arg He Leu Leu 

145 150 155 160 

Lys Asp Val Pro He Val He Leu Asp Glu Pro Thr Val Gly Leu Asp 

165 170 175 

Pro He Thr Glu Gin Ala Leu Leu Arg Val Phe Met Lys Glu Leu Glu 

180 185 * 190 

Gly Lys Thr Leu Val Trp He Thr His His Leu Lys Gly He Glu His 

195 200 205 

Ala Asp Arg He Leu Phe He Glu Asn Gly Gin Leu Glu Leu Glu Gly 
210 215 220 Y 

Ser Pro Gin Glu Leu Ser Gin Ser Ser Gin Arg Tyr Arg Gin Leu Lys 

225 230 235 2 40 

Ala Ala Asp Asp Gly Asp Leu 

245 

<210> 37 
<211> 3480 
<212> DNA 

<213> Streptococcus 
<400> 37 

aattctattt ggaggttttt cttgaataaa tggttagtta aggcaagttc cttagttgtt 60 
ttaggtggta tggttttatc tgcgggttcc cgagttttag cggatactta tgtccgtcca 120 
attgataatg gtagaattac aacaggtttc aatggttatc ctggacattg tggggtggat 180 
tatgctgttc cgactggaac gattattagg gcagtggcag atggtactgt gaaatttgca 240 
ggagctggag ccaacttttc ttggatgaca gacttagcag gaaattgtgt catgattcaa 300 
catgcggatg gaatgcatag tggttacgct catatgtcac gtgtggtggc taggactggg 360 
gaaaaagtca aacaaggaga tatcatcggt tacgtaggag caactggtat ggcgacggga 420 
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480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



cctcaccttc attttgaatt tttaccagct aaccctaatt ttcaaaatgg tttccatgga 
cgtatcaatc caacgtcact aattgctaac gttgcgacct ttagtggaaa aacgcaagca 
tcagctccaa gcattaagcc attacaatca gctcctgtac agaatcaatc tagtaaatta 
aaagtgtatc gagtagatga attacaaaag gttaatggtg tttggttagt caaaaataac 
accctaacgc cgactgggtt tgattggaac gataatggta taccagcatc agaaattgat 
gaggttgatg ctaatggtaa tttgacagct gaccaggttc ttcaaaaagg tggttacttt 
atctttaatc ctaaaactct taagactgta gaaaaaccca tccaaggaac agctggttta 
acttgggcta agacacgctt tgctaatggt agttcagttt ggcttcgcgt tgacaacagt 
caagaactgc tttacaaata gtttgaggta ttgattcatt gttttaaatg acagttttgt 
tactaactaa gtacaatttc tttaaaccgt ctgaaaataa ttttatagtc cagtaaagtg 
tgatattata gcctcggact aataaaaagg aaataggaat tgaagcaatg aaaatgaata 
aaaaggtact attgacatcg acaatggcag cttcgctatt atcagtcgca agtgttcaag 
cacaagaaac agatacgacg tggacagcac gtactgtttc agaggtaaag gctgatttgg 
taaagcaaga caataaatca tcatatactg tgaaatatgg tgatacacta agcgttattt 
cagaagcaat gtcaattgat atgaatgtct tagcaaaaat taataacatt gcagatatca 1320 
atcttattta tcctgagaca acactgacag taacttacga tcagaagagt catactgcca 
cttcaatgaa aatagaaaca ccagcaacaa atgctgctgg tcaaacaaca gctactgtgg 
atttgaaaac caatcaagtt tctgttgcag accaaaaagt ttctctcaat acaatttcgg 
aaggtatgac accagaagca gcaacaacga ttgtttcgcc aatgaagaca tattcttctg 
cgccagcttt gaaatcaaaa gaagtattag cacaagagca agctgttagt caagcagcag 
ctaatgaaca ggtatcaaca gctcctgtga agtcgattac ttcagaagtt ccagcagcta 
aagaggaagt taaaccaact cagacgtcag tcagtcagtc aacaacagta tcaccagctt 
ctgttgccgc tgaaacacca gctccagtag ctaaagtagc accggtaaga actgtagcag 
cccctagagt ggcaagtgtt aaagtagtca ctcctaaagt agaaactggt gcatcaccag 
agcatgtatc agctccagca gttcctgtga ctacgacttc aacagctaca gacagtaagt 
tacaagcgac tgaagttaag agcgttccgg tagcacaaaa agctccaaca gcaacaccgg 
tagcacaacc agcttcaaca acaaatgcag tagctgcaca tcctgaaaat gcagggctcc 
aacctcatgt tgcagcttat aaagaaaaag tagcgtcaac ttatggagtt aatgaattca 
gtacataccg tgcaggtgat ccaggtgatc atggtaaagg tttagcagtc gactttattg 
taggtaaaaa ccaagcactt ggtaatgaag ttgcacagta ctctacacaa aatatggcag 
caaataacat ttcatatgtt atctggcaac aaaagtttta ctcaaataca aatagtattt 2280 
atggacctgc taatacttgg aatgcaatgc cagatcgtgg tggcgttact gccaaccatt 
atgaccatgt tcacgtatca tttaacaaat aatataaaaa aggaagctat ttggcttctt 
ttttatatgc cttgaataga ctttcaaggt tcttatctaa tttttattaa attgaggaga 
ttaagctata agtctgaaac tactttcacg ttaaccgtga ctaaatcaaa acgttaaaac 
taaaatctaa gtctgtaaag attattgaaa acgctttaaa aacagatata ataaggtttg 
tagatatcta aaattaaaaa agataaggaa gtgagaatat gccacatcta agtaaagaag 
cttttaaaaa gcaaataaaa aatggcatta ttgtgtcatg tcaagctttg cctggggagc 
ctctttatac tgaaagtgga ggtgttatgc ctcttttagc tttggcagct caagaagcag 
gagcggttgg tataagagcc aatagtgtcc gcgacattaa ggaaattcaa gaagttacta 
atttacctat catcggcatt attaaacgtg aatatcctcc acaagaacca tttatcactg 
ctacgatgac agaggtggat caattagcta gtttagatat tgcagtaata gccttagatt 
gtacacttag agagcgtcat gatggtttga gtgtagctga gtttattcaa aagataaaag 
ggaaatatcc tgaacagttg ctaatggctg atataagtac ttttgaagaa ggtaaaaatg 
cttttgaagc aggagttgat tttgtgggta caactctatc tggatacaca gattacagcc 
gccaagaaga aggaccggat atagaactcc ttaataagct ttgtcaagcc ggtatagatg 
tgattgcgga aggtaaaatt catactccta agcaagctaa tgaaattaat catataggtg 
ttgcaggaat tgtagttggt ggtgctatca ctagaccaaa agaaatagcg gagcgtttca 
tctcaggact tagttaaaag tgttactcaa aaatcaaaat caaaataaaa aaggggaata 
gttatgagta tcaaaaaaag tgtgattggt ttttgcctcg gagctgcagc attatcaatg 
tttgcttgtg tagacagtag tcaatctgtt atggctgccg agaaggataa agtcgaaatt 

<210> 38 

<211> 306 

<212> PRT 

<213 > Streptococcus 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
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<400> 


38 


























Asn 


Ser 


He 


Trp 


Arg 


Phe 


Phe 


Leu 


Asn 


Lys 


Trp 


Leu 


Val 


Lys 


Ala 


Ser 


1 








5 










10 










15 




Ser 


Leu 


Val 


Val 
20 


Leu 


Gly 


Gly 


Met 


Val 
25 


Leu 


Ser 


Ala 


Gly 


Ser 
30 


Arg 


Val 


Leu 


Ala 


Asp 
35 


Thr 


Tyr 


Val 


Arg 


Pro 
40 


He 


Asp 


Asn 


Gly 


Arg 
45 


He 


Thr 


Thr 


Gly 


Phe 


Asn 


Gly 


Tyr 


Pro 


Gly 


His 


Cys 


Gly 


Val 


Asp 


Tyr 


Ala 


Val 


Pro 




50 










55 










60 








Thr 


Gly 


Thr 


He 


lie 


Arg 


Ala 


Val 


Ala 


Asp 


Gly 


Thr 


Val 


Lys 


Phe 


Ala 


65 










70 










75 








80 


Gly 


Ala 


Gly 


Ala 


Asn 


Phe 


Ser 


Trp 


Met 


Thr 


Asp 


Leu 


Ala 


Gly 


Asn 


Cys 










85 










90 










95 


Val 


Met 


He 


Gin 


His 


Ala 


Asp 


Gly 


Met 


His 


Ser 


Gly 


Tyr 


Ala 


His 


Met 








100 










105 








110 






Ser 


Arg 


Val 


Val 


Ala 


Arg 


Thr 


Gly 


Glu 


Lys 


Val 


Lys 


Gin 


Gly 


Asp 


lie 






115 










120 










125 




He 


Gly 
130 


Tyr 


Val 


Gly 


Ala 


Thr 
135 


Gly 


Met 


Ala 


Thr 


Gly 
140 


Pro 


His 


Leu 


His 


Phe 


Glu 


Phe 


Leu 


Pro 


Ala 


Asn 


Pro 


Asn 


Phe 


Gin 


Asn 


Gly 


Phe 


His 


Glv 


145 


He 








150 










155 










160 


Arg 


Asn 


Pro 


Thr 


Ser 


Leu 


He 


Ala 


Asn 


Val 


Ala 


Thr 


Phe 


Ser 


Gly 










165 










170 










175 


Lys 


Thr 


Gin 


Ala 
180 


Ser 


Ala 


Pro 


Ser 


lie 
185 


Lys 


Pro 


Leu 


Gin 


Ser 
190 


Ala 


Pro 


Val 


Gin 


Asn 
195 


Gin 


Ser 


Ser 


Lys 


Leu 
200 


Lys 


Val 


Tyr 


Arg 


Val 
205 


Asp 


Glu 


Leu 


Gin 


Lys 
210 


Val 


Asn 


Gly 


Val 


Trp 
215 


Leu 


Val 


Lys 


Asn 


Asn 
220 


Thr 


Leu 


Thr 


Pro 


Thr 


Gly 


Phe 


Asp 


Trp 


Asn 


Asp 


Asn 


Gl v 


Tip 


tr i {J 


i-i-L a 


O i-i V- 


\j J.U 


lie 


Asp 


225 










230 










235 










240 


Glu 


Val 


Asp 


Ala 


Asn 


Gly 


Asn 


Leu 


Thr 


Ala 


Asp 


Gin 


Val 


Leu 


Gin 


Lys 










245 










250 










255 


Gly 


Gly 


Tyr 


Phe 


He 


Phe 


Asn 


Pro 


Lys 


Thr 


Leu 


Lys 


Thr 


Val 


Glu 


Lys 








260 










265 










270 




Pro 


He 


Gin 


Gly 


Thr 


Ala 


Gly 


Leu 


Thr 


Trp 


Ala 


Lys 


Thr 


Arg 


Phe 


Ala 






275 










280 










285 






Asn 


Gly 
290 


Ser 


Ser 


Val 


Trp 


Leu 
295 


Arg 


Val 


Asp 


Asn 


Ser 
300 


Gin 


Glu 


Leu 


Leu 


Tyr 


Lys 






























305 

































<210> 39 
<211> 434 
<212> PRT 

<213 > Streptococcus 
<400> 39 

Met Lys Met Asn Lys Lys Val Leu Leu Thr Ser Thr Met Ala Ala Ser 

1 5 io is 

Leu Leu Ser Val Ala Ser Val Gin Ala Gin Glu Thr Asp Thr Thr Trp 

20 25 30 

Thr Ala Arg Thr Val Ser Glu Val Lys Ala Asp Leu Val Lys Gin Asp 

35 40 45 

Asn Lys Ser Ser Tyr Thr Val Lys Tyr Gly Asp Thr Leu Ser Val lie 
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50 

Ser Glu Ala Met 
65 

lie Ala Asp lie 

Tyr Asp Gin Lys 

100 

Ala Thr Asn Ala 
115 

Asn Gin Val Ser 
130 

Glu Gly Met Thr 
145 

Thr Tyr Ser Ser 

Glu Gin Ala Val 

180 

Pro Val Lys Ser 
195 

Lys Pro Thr Gin 
210 

Ser Val Ala Ala 
225 

Arg Thr Val Ala 

Lys Val Glu Thr 

260 

Pro Val Thr Thr 
275 

Glu Val Lys Ser 
290 

Val Ala Gin Pro 
305 

Asn Ala Gly Leu 

Ser Thr Tyr Gly 

340 

Gly Asp His Gly 
355 

Gin Ala Leu Gly 
370 

Ala Asn Asn lie 
385 

Thr Asn Ser lie 

Arg Gly Gly Val 

420 

Asn Lys 



. 55 

Ser lie Asp Met 
70 

Asn Leu lie Tyr 
85 

Ser His Thr Ala 

Ala Gly Gin Thr 

12 0 

Val Ala Asp Gin 
135 

Pro Glu Ala Ala 
150 

Ala Pro Ala Leu 
165 

Ser Gin Ala Ala 

He Thr Ser Glu 

200 

Thr Ser Val Ser 
215 

Glu Thr Pro Ala 
230 

Ala Pro Arg Val 
245 

Gly Ala Ser Pro 

Thr Ser Thr Ala 

280 

Val Pro Val Ala 
295 

Ala Ser Thr Thr 
310 

Gin Pro His Val 
325 

Val Asn Glu Phe 

Lys Gly Leu Ala 

360 

Asn Glu Val Ala 
375 

Ser Tyr Val He 
390 

Tyr Gly Pro Ala 
405 

Thr Ala Asn His 



60 

Asn Val Leu Ala 
75 

Pro Glu Thr Thr 
90 

Thr Ser Met Lys 
105 

Thr Ala Thr Val 

Lys Val Ser Leu 

140 

Thr Thr He Val 
155 

Lys Ser Lys Glu 
170 

Ala Asn Glu Gin 
185 

Val Pro Ala Ala 

Gin Ser Thr Thr 

220 

Pro Val Ala Lys 
235 

Ala Ser Val Lys 
250 

Glu His Val Ser 
265 

Thr Asp Ser Lys 

Gin Lys Ala Pro 

300 

Asn Ala Val Ala 
315 

Ala Ala Tyr Lys 
330 

Ser Thr Tyr Arg 
345 

Val Asp Phe He 

Gin Tyr Ser Thr 

380 

Trp Gin Gin Lys 
395 

Asn Thr Trp Asn 
410 

Tyr Asp His Val 
425 



Lys He Asn Asn 

80 

Leu Thr Val Thr 
95 

He Glu Thr Pro 
110 

Asp Leu Lys Thr 
125 

Asn Thr He Ser 

Ser Pro Met Lys 

160 

Val Leu Ala Gin 
175 

Val Ser Thr Ala 
190 

Lys Glu Glu Val 
205 

Val Ser Pro Ala 

Val Ala Pro Val 

240 

Val Val Thr Pro 
255 

Ala Pro Ala Val 
270 

Leu Gin Ala Thr 
285 

Thr Ala Thr Pro 

Ala His Pro Glu 

320 

Glu Lys Val Ala 
335 

Ala Gly Asp Pro 
350 

Val Gly Lys Asn 
365 

Gin Asn Met Ala 

Phe Tyr Ser Asn 

400 

Ala Met Pro Asp 
415 

His Val Ser Phe 
430 



<210> 40 
<211> 232 
< 2 1 2 > PRT 

<2 13 > Streptococcus 
<400> 40 
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Met Pro His Leu 

1 

lie lie Val Ser 

20 

Ser Gly Gly Val 
35 

Ala Val Gly lie 
50 

Glu Val Thr Asn 
65 

Pro Gin Glu Pro 

Ala Ser Leu Asp 

100 

Arg His Asp Gly 
115 

Lys Tyr Pro Glu 
130 

Gly Lys Asn Ala 
145 

Ser Gly Tyr Thr 

Leu Leu Asn Lys 

180 

Lys lie His Thr 
195 

Ala Gly lie Val 
210 

Glu Arg Phe lie 
225 



Ser Lys Glu Ala 
5 

Cys Gin Ala Leu 

Met Pro Leu Leu 

40 

Arg Ala Asn Ser 
55 

Leu Pro lie lie 
70 

Phe lie Thr Ala 
85 

He Ala Val He 

Leu Ser Val Ala 

120 

Gin Leu Leu Met 
135 

Phe Glu Ala Gly 
150 

Asp Tyr Xaa Arg 
165 

Leu Cys Gin Ala 

Pro Lys Gin Ala 

200 

Val Gly Gly Ala 
215 

Ser Gly Leu Ser 
230 



Phe Lys Lys Gin 
10 

Pro Gly Glu Pro 
25 

Ala Leu Ala Ala 

Val Arg Asp He 

60 

Gly He He Lys 
75 

Thr Met Thr Glu 
90 

Ala Leu Asp Cys 
105 

Glu Phe He Gin 

Ala Asp He Ser 

140 

Val Asp Phe Val 
155 

Gin Glu Glu Gly 
170 

Gly He Asp Val 
185 

Asn Glu He Asn 

He Thr Arg Pro 

220 



He Lys Asn Gly 
15 

Leu Tyr Thr Glu 
30 

Gin Glu Ala Glv 
45 

Lys Glu He Gin 

Arg Glu Tyr Pro 

80 

Val Asp Gin Leu 
95 

Thr Leu Arg Glu 
110 

Lys He Lys Gly 
125 

Thr Phe Glu Glu 

Gly Thr Thr Leu 

160 

Pro Asp He Glu 
175 

He Ala Glu Gly 
190 

His He Gly Val 
205 

Lys Glu He Ala 



<210> 41 
<211> 39 
<212> PRT 

<213> Streptococcus 



<400> 41 

Met Ser He Lys Lys Ser Val He Gly Phe Cys Leu Gly Ala Ala Ala 

Leu Ser Met Phe Ala Cys Val Asp Ser Ser Gin Ser Val Met Ala Ala 

20 2 5 30 

Glu Lys Asp Lys Val Glu He 
35 



<210> 42 
<211> 1305 
<212> DNA 

<2 13 > Streptococcus 



<400> 42 

atgaaaatga ataaaaaggt actattgaca tcgacaatgg cagcttcgct attatcagtc 60 

gcaagtgttc aagcacaaga aacagatacg acgtggacag cacgtactgt ttcagaggta 120 

aaggctgatt tggtaaagca agacaataaa tcatcatata ctgtgaaata tggtgataca 180 

ctaagcgtta tttcagaagc aatgtcaatt gatatgaatg tcttagcaaa aattaataac 240 

attgcagata tcaatcttat ttatcctgag acaacacnga cagtaactta cgatcagaag 300 

agtcatactg ccacttcaat gaaaatagaa acaccagcaa caaatgctgc tggtcaaaca 360 
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420 
480 



acagctactg tggatttgaa aaccaatcaa gtttctgttg cagaccaaaa agtttctctc 
aatacaattt cggaaggtat gacaccagaa gcagcaacaa cgattgtttc gccaatgaag 
acatattctt ctgcgccagc tttgaaatca aaagaagtat tagcacaaga gcaagctgtt 540 
agtcaagcag cagctaatga acaggtatca acagctcctg tgaagtcgat tacttcagaa 
gttccagcag ctaaagagga agttaaacca actcagacgt cagtcagtca gtcaacaaca 
gtatcaccag cttctgttgc cgctgaaaca ccagctccag tagctaaagt agcaccggta 
agaactgtag cagcccctag agtggcaagt gttaaagtag tcactcccaa agtagaaact 
ggtgcatcac cagagcatgt atcagctcca gcagttcctg tgactacgac ttcaacagct 
acagacagta agttacaagc gactgaagtt aagagcgttc cggtagcaca aaaagctcca 
acagcaacac cggtagcaca accagcttca acaacaaatg cagtagctgc acatcctgaa 
aatgcagggc tccaacctca tgttgcagct tataaagaaa aagtagcgtc aacttatgga 1020 
gttaatgaat tcagtacata ccgtgcaggt gatccaggtg atcatggtaa aggtttagca 
gtcgacttta ttgtaggtaa aaaccaagca cttggtaatg aagttgcaca gtactctaca 
caaaatatgg cagcaaataa catttcatat gttatctggc aacaaaagtt ttactcaaat 
acaaatagta tttatggacc tgctaatact tggaatgcaa tgccagatcg tggtggcgtt 1260 
actgccaacc attatgacca tgttcacgta tcatttaaca aataa 1305 

<210> 43 
<211> 1230 
c212> DNA 

<213 > Streptococcus 



<400> 43 

caagaaacag atacgacgtg gacagcacgt actgtttcag aggtaaaggc tgatttggta 
aagcaagaca ataaatcatc atatactgtg aaatatggtg atacactaag cgttatttca 
gaagcaatgt caattgatat gaatgtctta gcaaaaatta ataacattgc agatatcaat 
cttatttatc ctgagacaac actgacagta acttacgatc agaagagtca tactgccact 
tcaatgaaaa tagaaacacc agcaacaaat gctgctggtc aaacaacagc tactgtggat 
ttgaaaacca atcaagtttc tgttgcagac caaaaagttt ctctcaatac aatttcggaa 
ggtatgacac cagaagcagc aacaacgatt gtttcgccaa tgaagacata ttcttctgcg 
ccagctttga aatcaaaaga agtattagca caagagcaag ctgttagtca agcagcagct 
aatgaacagg tatcaacagc tcctgtgaag tcgattactt cagaagttcc agcagctaaa 
gaggaagtta aaccaactca gacgtcagtc agtcagtcaa caacagtatc accagcttct 
gttgccgctg aaacaccagc tccagtagct aaagtagcac cggtaagaac tgtagcagcc 
cctagagtgg caagtgttaa agtagtcact cctaaagtag aaactggtgc atcaccagag 
catgtatcag ctccagcagt tcctgtgact acgacttcaa cagctacaga cagtaagtta 
caagcgactg aagttaagag cgttccggta gcacaaaaag ctccaacagc aacaccggta 
gcacaaccag cttcaacaac aaatgcagta gctgcacatc ctgaaaatgc agggctccaa 
cctcatgttg cagcttataa agaaaaagta gcgtcaactt atggagttaa tgaattcagt 
acataccgtg caggtgatcc aggtgatcat ggtaaaggtt tagcagtcga ctttattgta 
ggtaaaaacc aagcacttgg taatgaagtt gcacagtact ctacacaaaa tatggcagca 
aataacattt catatgttat ctggcaacaa aagttttact caaatacaaa tagtatttat 
ggacctgcta atacttggaa tgcaatgcca gatcgtggtg gcgttactgc caaccattat 
gaccatgttc acgtatcatt taacaaataa 

c210> 44 
<211> 409 
<212> PRT 

<2 13 > Streptococcus 
<400> 44 

Gin Glu Thr Asp Thr Thr Trp Thr Ala Arg Thr Val Ser Glu Val Lys 

! 5 10 15 

Ala Asp Leu Val Lys Gin Asp Asn Lys Ser Ser Tyr Thr Val Lys Tyr 

20 25 30 

Gly Asp Thr Leu Ser Val He Ser Glu Ala Met Ser He Asp Met Asn 



600 
660 
720 
780 
840 
900 
960 



1080 
1140 
1200 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1230 
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Val Leu Ala Lys He Asn Asn lie Ala Asp lie Asn Leu lie Tyr Pro 
Glu Thr Thr Leu Thr Val Thr Tyr Asp Gin Lys ler His Thr Ala Thr 
Ser Met Lys He Glu Thr Pro Ala Thr Asn IL Ala Gly Gin Thr Tnr 
Ala Thr Val Asp Leu Lys Thr Asn Gin Val Ser Val Ala Asp gL Lys 
Val Ser Leu Asn Thr lie Ser Glu Gly Met Thr Pro Glu III Ala Thr 



Thr lie Val Ser Pro Met Lys Thr Tyr Ser Ser Ala Pro Ala Leu Lys 

135 140 
Ser Lys Glu Val Leu Ala Gin Glu Gin Ala Val Ser Gin Ala Ala Ala 

Asn Glu Gin Val Ser Thr Ala Pro Val Lys Ser He Thr Ser Glu V^ 

Pro Ala Ala Lys Glu Glu Val Lys Pro Thr Gin Thr Ser Val sir Gin 

Ser Thr Thr Val Ser Pro Ala Ser Val Ala Ala Glu Thr Pro Ala Pro 

Val Ala Lys Val Ala Pro Val Arg Thr Val Ala Ala Pro Arg Val Ala 

Ser Val Lys Val Val Thr Pro Lys Val Glu Thr Gly Ala Ser Pro Glu 

His Val Ser Ala Pro Ala Val Pro Val Thr Thr Thr Ser Thr Ala Tnr 

Asp Ser Lys Leu Gin Ala Thr Glu Val lyl Ser Val Pro Val III Gin 

Lys Ala Pro Thr Ala Thr Pro Val "a Gin Pro Ala Ser Thr Thr Asn 

2 80 

Ala Val Ala Ala His Pro Glu Asn Ala Gly Leu Gin Pro His Val Ala 

Ala Tyr Lys Glu Lys Val Ala Ser Thr Tyr Gly III A sn Glu Phe Ser 

Thr Tyr Arg Ala Gly Asp Pro Gly Asp His lly Lys Gly Leu Ala Val 

Asp Phe lie yal Gly Lys Asn Gin Ala III Gly Asn Glu Val 111 cm 

345 350 
Tyr Ser Thr Gin Asn Met Ala Ala Asn Asn He Ser Tyr Val He Trr> 

355 36 ° 365 

Gin Gin Lys Phe Tyr Ser Asn Thr Asn Ser He Tyr Gly Pro Ala Asn 

J/u 375 



Thr Trp Asn Ala Met Pro Asp Arg Gly Gly Val Thr Ala Asn His Tyr 
385 390 395 / 00 



Asp His Val His Val Ser Phe Asn Lys 

405 
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